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CAR RECEPTORS 
AND RELATED MOLECT l.ES AND METHODS 

5 Background of the Invention 

This application is a continuation-in-part of U.S. S.N. 07/843,350, 
filed February 26, 1992. This invention relates to receptors, particularly 
nuclear hormone receptors. 

In higher organisms, the nuclear hormone receptor superfamily 

10 includes approximately a dozen distinct genes that encode zinc finger 

transcription factors, each of which is specifically activated by binding a lieand 
such as a steroid, thyroid homione (T3) or retinoic acid (RA). However, there 
is an additional, somewhat larger group of cDNAs that encode proteins that do 
not bind or respond to any known ligand. These members of the superfamily 

15 are called orphan receptors. While the role of the better characterized 

conventional receptors in regulating important processes in developing and 
adult individuals is becoming clearer, the function of the orphan receptors has 
been uncertain. 

A number of the conventional and orphan members of the 

20 superfamily share identical or very similar amino acid sequences in an 

important region of the first zinc finger. Both genetic analyses and X-ray 
crystallography indicate that this region, termed the P box, makes sequence 
specific contacts with the DNA. The conventional receptors in this P box- 
defined subgroup include those that bind estrogen, vitamin D, T3 and RA, and 

25 nearly all of the orphan receptors identified to date also fall into this class. As a 
consequence of this overlap in binding specificity, many hormone response 
elements can bind more than one type of receptor. The best characterized of 
these is the element upstream of the rat growth hormone gene, which can be 



activated by three different isoforms of the T3 receptor encoded by two 
different genes and by an unknown number of retinoic acid receptor isoforms 
encoded by three different genes. While it does not appear to respond to the 
estrogen receptor or the vitamin D receptor, its response to other members of 
the subgroup remains uncertain. 

Recently the potential complexity of the interactions of the 
conventional receptors with their response elements has been substantially 
increased by the demonstration that the three closely related RXR proteins can 
form heterodimers with the thyroid hormone, retinoic acid and vitamin D 
receptors. These heterodimers show higher binding affinity for appropriate 
response elements, and the RXRs are hypothesized to play central roles in 
signal transduction by all three classes of receptors. The impact on such 
heterodimers of the binding of the retinoid metabolite 9-cis retinoic acid by the 
RXRs remains unclear. 

Summary of the Invention 

In general, the invention features substantially pure CAR receptor 
polypeptide. Preferably, such a receptor polypeptide includes an amino acid 
sequence substantially identical to the amino acid sequence shown in Fig. 1 
(SEQ ID NO: 10) or Fig. 2 (SEQ ID NO:12). 

The invention further features a substantially pure polypeptide which 
includes a CAR receptor DNA binding domain and a CAR receptor gene 
activation domain (preferably, also including the ligand binding domain). 
Preferably, the DNA binding domain includes a sequence substantially 
identical to amino acids 1 1-75 of Fig. 1 (SEQ ID NO: 10) or amino acids 21-86 
of Fig. 2 (SEQ ID NO: 12), or a DNA binding fragment thereof; and the gene 
activation or ligand binding/gene activation domain includes a sequence 
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substantially identical to ammo acids 76-348 of Fig. 1 (SEQ ID NO: 10) or 
amino acids 182-358 or 352-357 of Fig. 2 (SEQ ID NO:12), or a gene 
activating fragment thereof. 

In a related aspect, the invention features a substantially pure 
5 polypeptide which includes a CAR receptor heterodimerization domain. 

In preferred embodiments of various aspects, the receptor 
polypeptide is mammalian, and preferably, human or murine. 

In yet other aspects, the invention features substantially pure DNA 
which encodes a CAR receptor polypeptide of the invention. Preferably, such 
10 DNA i=; rONA' and enr,nHe<; a human nr mnT-in^^' PAP r<=*r^^-ntr*r T-k<^Kn-i*»T-»+i/-l*» 

The invention also features a vector which includes such substantially pure 
DNA and which is capable of directing expression of the protein encoded by 
the DNA in a vector-containing cell. Finally, the invention features a cell 
which contains the substantially pure DNA. Preferably, the cell is a eukaryotic 

1 5 cell, for example, a mammalian cell. 

In another aspect, the invention features a method of producing a 
recombinant CAR receptor polypeptide (or a fragment or analog thereof). The 
method involves (a) providing a cell transformed with DNA encoding a CAR 
receptor or a fragment or analog thereof positioned for expression is the cell; 

20 (b) culturing the transformed cell under conditions for expressing the DNA; 
and (c) isolating the recombinant CAR receptor polypeptide. 

In yet another aspect, the invention features a substantially pure 
antibody which specifically binds a CAR receptor polypeptide of the invention. 

In yet other aspects, the invention features therapeutic compositions 

25 which include as an active ingredient a CAR receptor polypeptide of the 

invention formulated in a physiologically-acceptable earner. Such therapeutic 
compositions may be used in methods of treating Graves' disease or cancer (for 



example, lung cancer) in a mammal; such methods involve administering the 
therapeutic composition to the mammal in a dosage effective to decrease 
thyroid hormone receptor function (for the treatment of Gr^^ves* disease) or in a 
dosage effective to increase retmoic acid receptor expression (for the treatment 
of cancer). 

In yet other aspects, the invention features methods of identifying a 
CAR ligand. One method involves (a) providing a nucleic acid sequence which 
encodes a CAR receptor polypeptide; (b) introducing into a host cell which is 
functionally deficient for CAR receptor (i) the nucleic acid which encodes the 
CAR receptor polypeptide (preferably, a CAR receptor polypeptide including 
an amino acid sequence substantially identical to the amino acid sequence 
shown in Fig. 1; SEQ ID NO: 10 or Fig. 2; SEQ ID NO: 12) and (ii) a reporter 
gene operably linked to a CAR receptor polypeptide binding site (preferably, 
the binding site GGGTAGGGTTCACCGAAAGTTCACTCG; SEQ ID NO: 5); 
(c) measuring induction of the reporter gene in the transfected host cell; (d) 
contacting the transfected host cell with a candidate ligand; and (e) measunng 
induction of said reporter gene in the presence of the candidate ligand, an 
increase or decrease in the induction as compared to the induction in (c) being 
indicative of the presence of a CAR ligand. 

The second method involves (a) providing a nucleic acid sequence 
which encodes a CAR receptor polypeptide (preferably, mcluding an amino 
acid sequence substantially identical to the amino acid sequence shown in Fig. 
1; SEQ ID NO: 10 or Fig. 2; SEQ ID NO: 12); (b) introducing the nucleic acid 
into a host cell so that the recombinant CAR receptor polypeptide is expressed; 
(c) isolating the recombinant protein; (d) immobilizing the recombinant protein 
on a solid substrate (preferably, a column); (e) contacting the immobilized 
recombinant protein with a candidate ligand under conditions which allow 
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formation of an affinity complex between the immobilized recombinant CAR 
receptor polypeptide and the candidate ligand; and (f) detecting complex 
formation as an indication of the presence of a CAR ligand. 

In yet other aspects, the invention features methods of identifying a 
5 CAR receptor DNA binding site. One method involves (a) providing a nucleic 
acid sequence which encodes a CAR receptor polypeptide (preferably including 
an amino acid sequence substantially identical to the amino acid sequence 
shown in Fig. 1; SEQ ID NO: lOorFig. 2; SEQ IDNO:12;); (b) introducing 
into a host cell which is functionally deficient for CAR receptor (i) the nucleic 

which is operably linked to a candidate CAR receptor DNA binding site; and 
( c) measuring induction of the reporter gene in the transfected host cell, 
induction being indicative of the presence of an operably linked CAR receptor 
DNA binding site. 

15 A second method involves (a) providing a nucleic acid sequence 

which encodes a CAR receptor polypeptide {preferably, including an ammo 
acid sequence substantially identical to the amino acid sequence shown in Fig. 
1; SEQ ID NO: 10 or Fig. 2; SEQ ID NO:12); (b) introducing the nucleic acid 
into a host cell so that the recombinant CAR receptor polypeptide is expressed; 

20 ( c) isolating the recombinant protein; (d) contacting the recombinant protein 
with a candidate DNA binding site under conditions which allow formation of 
an affinity complex between the recombinant CAR receptor polypeptide and 
the candidate binding site; and (e) detecting complex formation as an indication 
of the presence of a CAR receptor DNA binding site. 

25 In a ilnal aspect, the invention features chimeric receptors. Such 

chimeric receptors may include the DNA binding domain of a CAR receptor 
polypeptide (preferably, including a sequence substantially identical to amino 
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acids 11-75 of Fig. 1; SEQ ID NO: 10 or amino acids 21-86 of Fig. 2 SEQ ID 
NO: 12, or a DNA binding fragment thereof) fused to the gene activation (and, 
preferably, the ligand binding domain ) of a heterolo20us Drotein nrpfemhK/ ^ 
nuclear hormone receptor, or a protein chosen from the group consisting of: 
5 glucocorticoid receptor, a-retinoic acid receptor, P-retinoic acid receptor, y- 
retinoic acid receptor, estrogen receptor, progesterone receptor, vitamin D 
receptor, mineralocorticoid receptor, thyroid receptor, VP 16, and GAL4; or the 
chimeric receptor may include the gene activation domain of a CAR receptor 
polypeptide (preferably, including a sequence substantially identical to amino 

10 acids 76-348 of Fig. 1; SEQ ID NO: 10 or amino acids 182-358 of Fig. 2; SEQ 
ID NO: 12, or a gene activating fragment thereof) fused to the DNA binding 
domain of a heterologous protein, preferably, a nuclear hormone receptor or a 
protein chosen from the group consisting of: glucocorticoid receptor, a-retinoic 
acid receptor, P-retinoic acid receptor, y-retinoic acid receptor, estrogen 

1 5 receptor, progesterone receptor, vitamin D receptor, mineralocorticoid receptor, 
thyroid receptor, and GAL4. 

By "CAR receptor polypeptide" is meant a polypeptide which is 
capable of binding to a DNA sequence of 

GGGTAGGGTTCACCGAAAGTTCACTCG (SEQ ID NO: 5) and activating 
10 the expression of downstream genes, even when mammalian cells harboring the 
receptor are grown in medium containing charcoal-stripped serum; in 
particular, a CAR receptor polypeptide activates such gene expression in the 
absence of retinoic acid. 

By "substantially pure" is meant that the CAR receptor polypeptide 
:5 provided by the invention is at least 60%, by weight, free fVom the proteins and 
naturally-occurring organic molecules with which it is naturally associated. 
Preferably, the preparation is at least 75%, more preferably at least 90%, and 
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most preferably at least 99?o, by weight, CAR receptor polypeptide, A 
substantially pure CAR receptor polypeptide may be obtained, for example, by 
extraction from a natural source (e.g., a mammalian liver cell); by expression of 
a recombinant nucleic acid encoding a CAR receptor poKpeptide, or by 
5 chemically synthesizing the protein. Purity can be measured by any 
appropriate method, e.g., column chromatography, polyacrylamide gel 
electrophoresis, or HPLC analysis. By a "polypeptide" is meant any chain of 
amino acids, regardless of length or post-translational modification (e.g., 
glycosylation) 

1 ^ By "substantially identical" is meant an ammo acid sequence whuSii 

diiTers only by conservative amino acid substitutions, for example, substitution 
of one amino acid for another of the same class (e.g., valine for glycine, 
arginine for lysine, etc.) or by one or more non-conservative substitutions, 
deletions, or insertions located at positions of the amino acid sequence which 

15 do not destroy the function of the protein or domain (assayed, e.g., as described 
herein). Preferably, the amino acid sequence is at least 60%, preferably^ 70%^ 
more preferably, 85%, and, most preferably, 95% identical to the sequence of 
either SEQ ID NO: 10 or SEQ ID NO: 12. A "substantially identical" nucleic 
acid sequence codes for a substantially identical amino acid sequence as 

20 defined above. 

By a "DNA binding domain" is meant a stretch of amino acids which 
is capable of directing specific polypeptide (e.g., receptor) binding to a 
particular DNA sequence. 

By "gene activation domain" is meant a stretch of amino acids which 

25 IS capable of inducing the expression of a gene to whose control region it is 
bound. 
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By "heterodimerization domain" is meant a stretch of amino acids 
which is capable of directing specific complex formation with a heterologous 
protein; such a domain may direct the formation of dimers. trimers. tetramers. 
or other higher order hetero-oligomers. 
5 By "substantially pure DNA" is meant DNA that is free of the genes 

which, in the naturally-occurring genome of the organism from which the DNA 
of the invention is derived, flank the gene. The term therefore includes, for 
example, a recombinant DNA which is incorporated into a vector; into an 
autonomously replicating plasmid or virus; or into the genomic DNA of a 
10 prokaryote or eukaryote; or which exists as a separate molecule (e.g., a cDNA 
or a genomic or cDNA fragment produced by PGR or restriction endonuclease 
digestion) independent of other sequences. It also includes a recombinant DNA 
which is part of a hybrid gene encoding additional polypeptide sequence. 

By "transformed cell" is meant a cell into which (or into an ancestor 
15 of which) has been mtroduced, by means of recombinant DNA techniques, a 
DNA molecule encoding (as used herein) a CAR receptor polypeptide. 

By "positioned for expression" is meant that the DNA molecule is 
positioned adjacent to a DNA sequence which directs transcription and 
translation of the sequence (i.e., facilitates the production of, e.g., CAR 
20 receptor polypeptide). 

By "a host cell which is functionally deficient for CAR receptor" is 
meant a cell, e.g., a mammalian cell, which exhibits little or no CAR receptor- 
mediated gene stimulatory activity; such activity may be measured in standard 
transactivation assays using, e.g., a transfected reporter gene operably linked to 
25 a CAR binding site as described herein. 

By "substantially pure antibody" is meant antibody which is at least 
60%, by weight, free from the proteins and namrally-occurring organic 
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molecules with which it is naturally associated. Preferably, the preparation is at 
least 75%, more preferably at least 90%. and most preferably at least 99%, by 
weight, antibody, e.g., CAR receptor-specific antibody. A substantially pure 
CAR receptor antibody may be obtained, for example, by affinity 
5 chromatography using recombinantly-produced CAR receptor polypeptide and 
standard techniques. 

By "specifically binds", as used herein, is meant an antibody which 
recognizes and binds CAR receptor polypeptide but which does not 
substantially recognize and bind other molecules in a sample, e.g., a biological 
10 <;?!Tnnlp whirh n^itiirnllv inrliirlf^Q PAP Tf^nf»ryfr\r r\r\]\rry^T\tiA^ 

~ - J 7 " ■ ' — ' - ' J — • — — — X. ^ ^ V w 1^ j-f jr vj. w . 

By "reporter gene" is meant a gene whose expression may be 
assayed; such genes include, without limitation, chloramphenicol transacetylase 
(CAT) and P-galactosidase. 

By "operably linked" is meant that a gene and a regulatory 
15 sequence(s) are connected in such a way as to permit gene expression when the 
appropriate molecules (e.g., transcriptional activator proteins) are bound to the 
regulatory sequence(s). 

By "heterologous" is meant any protein other than the human CAR 
receptor which includes a suitable (i.e., a DNA binding, gene activation, and/or 
20 ligand binding) domain. 

Other features and advantages of the invention will be apparent from 
the following description of the preferred embodiments thereof, and from the 
claims. 
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Detailed Deqrrip tion 
The drawings will first briefly be described. 

Drawings 

Fig. 1 IS the nucleic acid sequence (SEQ ID NO. l) and deduced 
amino acid sequence (SEQ ID NO: 10) of the human CAR receptor. 

Fig. 2 is the nucleic acid (SEQ ID NO: 11) and deduced amino acid 
sequence (SEQ ID NO: 12) of the murine CAR receptor mCARl . 

Figure 3 A is a schematic representation of the sequences of mCARl 
and mCAR/'MB67. The sequence of mCARl (Genbank accession number 
AF009327) is shown, and differences between mCARl and the previously 
described MB67 (referred to here as hCAR) and also the mCAR2 variant 
(Genbank accession number AF900328) are indicated. The DNA binding 
domain is in bold. Dots indicate residues not present in hCAR. The positions 
of the introns in the mCAR gene are indicated. The first intron is 5 nucleotides 
upstream of the ATG encoding the first methionine. Introns that fall within a 
codon are indicated after the corresponding amino acid. 

Figure 3B is a schematic representation indicating comparisons of 
mCARl to related members of the nuclear hormone receptor superfamily. The 
DNA binding, hinge, and Hgand binding/dimenzation domains of mCARl are 
indicated, and the percent identity of the analogous domains of related proteins 
is indicated. VDR is the human vitamin D receptor, xONR is a Xenopus laevis 
orphan receptor (Smith et al., Nucl. Acids Res. 22:.^6-l\, 1994), and EcR is the 
Drosophila melanogaster ecdysone receptor. 

Figures 4A and 43 are photographs demonstrating expression of 
mCARl and mCAR2 in liver mRNA. mRNA based PCR was used to amplify 
mCAR sequences using primers flanking the mCAR2 deletion, with mouse 
liver mRNA as a template. In Figure 4A, the positions of amplified bands of 
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the sizes expected for mCARl and mC\R2 are indicated. The products of the 
mRNA based PCR reaction are in the left lane. Size markers are m the right 
lane. In Figure 4B, an expenment is shown in which the gel in Figure 4A was 
blotted and hybndized with probes corresponding to the intact mCAR ligand 
5 binding domain or just exon 8, as indicated. Control lanes showed no bands by 
ethidium staining or hybridization. 

Figure 5A is a diagram indicating the positions of the mCAR introns, 
5' untranslated and 3' untranslated regions of the primar}' transcnpt, and the 
positions of various portions of the transcnpt and protein. For the 3' extended 
10 transcnpts, the 3' untranslated region extends into the 3' fianking region The 
8th exon deleted in mCAR2 is indicated by a star. 

Figure 5B is a photograph showing a primer extension analysis of 
mCAR transcripts- Either control RNA (lane 1 ) or total mouse liver RNA (lane 
2) was used as a template for primer extension using a 5' end labeled mCAR 
1 5 primer from the 5' untranslated region. Products were resolved next to a 
sequence ladder generated using the same primer. The arrow indir^tes 
specifically extended product. To the right of the photograph is indicated the 
position of the primary transcriptional start site relative to the mCAR genomic 
sequence (Genbank accession number AF009326). 
20 Figure 6 is a photograph and diagram indicating the expression of 

mCAR transcripts. A Northern blot containing mouse polyA^ mRNA from 
vanous tissues (Clontech) was sequentially hybridized with either a full length 
mCAR probe ("A") or a series of shorter probes as indicated. Probes "B" and 
"C" were generated by PCR using the mCAR cDNA as a template, and probe 
2^ "D" was generated by PCR using the genomic clone as a template. Only the 
results with liver mRNA are shown for the smaller probes. In much longer 
exposures of hybridizations with either the full length probe or the two probes 



from the 3' end. an additional transcript of approximately 2.8 kb was observed 
in heart, brain, skeletal muscle, and kidney. 

Figure 7 is a photograph showing DNA bmding by mCARl and 
mCAR2. mCARl , mCAR2, RXR, and RAR proteins were expressed by in 
vitro translation and used for electrophoretic mobility shift assays with a 
PRARE probe as indicated. "S" indicates competition with the pRARE 
oligonucleotide, and "NS" competition with a nonspecific oligonucleotide. 
Equivalent amounts of mCARl, mCAR2. and RAR were used in the binding 
reactions, as determined by ^^S-methionine labeling. 

Figures 8A and 8B are graphs indicating transactivation by mCARl 
and mCAR2. In Figure 8 A, a luciferase reporter containing three copies of the 
PRARE upstream of the TK promoter was cotransfected into HepG2 cells with 
the indicated receptor expression vectors and a TKGH control plasmid. 
Normalized luciferase activity is expressed by comparison to the activity 
observed with the CDM8 vector alone. In Figure 8B, two reporter constructs 
containing 336 bp of the RARp2 promoter driving expression of luciferase 
(Moghal et al., Mol. Cell. Biol. 15:3945-3959 1995) were cotransfected into 
HepG2 cells with control, mCARl, or mCAR2 expression vectors, as indicated, 
along with the TKGH internal control. In the -336 mut reporter, the pRARE 
was inactivated by a cluster of point mutants (Moghal et a]., Mol. Cell. Biol. 
15:3945-3959 1995). Normalized luciferase expression is presented. 

Figure 9A is a diagram showing the sequence of the AF-2 region of 
mCARl. The sequence that matches the AF-2 consensus of a glutamate 
flanked by paired hydrophobic residues (Danielian et al., EMBO J. 11:1025- 
1033, 1992) is in bold, and a sequence that corresponds to the 9th heptad in TR 
and other receptors (Forman et al., Mol. Endocrinol. 4:1293-1302. 1990) is 
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underlined. Mutations introduced into the mC\Rl sequence are indicated; 
dashes represent residues present in the mutant proteins. 

Figure 9B is a photograph showing transactivation by mCARl and 
AF-2 mutants. HepG2 cells were cotransfected with either the CDM8 vector 
5 alone or the vectors expressing the indicated CAR proteins, the PRARE-TKluc 
reporter, and a TKGH internal control. Expression relative to the wild type 
mCARl is indicated. 

Figure 9C is a graph indicating DNA binding by mCAR mutants. 
WHiole cell extracts from COSM6 cells transfected with expression vectors for 
1 0 the indicated mCAR proteins were incubated w^ith analo.sous extracts from 

RXR transfected celL and the pRARE oligonucleotide, and specific complexes 
were resolved by electrophoretic mobility shift. 



There now follows a description of the cloning and characterization 
of human and murine CAR receptor-encodmg cDNAs useful in the invention. 

15 This examxple is provided for the purpose of illustrating the invention, and 
should not be construed as limiting. 
Isolation of the Human CAR Receptor 

A human liver cDNA library was screened by standard techniques 
with the following degenerate oligonucleotide probe: TG C/T GAG GGI TG 

20 C/T .AAG G G/C ITT C/T TT C/T A/C G (SEQ ID NO. 2). This probe was 
based on the sequence of the P box of the thyroid/retinoid receptor subgroup, 
the most highly conserved portion of the DNA binding domain. As expected, a 
number of clones encoding previously descnbed members of the nuclear 
receptor superfamily were isolated. Based on limited sequence analysis (by 

25 standard techniques), one clone that did not correspond to any previously 
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reported cDNA was chosen for funher analysis. The complete sequence of this 
cDNA is presented in Fig. 1 (SEQ ID NO: 1). 

As indicated in Fig. 1, this cDNA encodes a protein of 348 amino 
acids that contains conser\'ed features of the nuclear receptor superfamily in 
both the DNA binding (C) domain and the putative ligand binding/dimerization 
(E) domain. Because of the activities of this protein described below, it is 
called CAR (Constimtive Activator of Retinoic acid response elements). CAR 
is one of the smallest superfamily members, with the shortest known A/B 
domain. 

As expected from the oligonucleotide used for screening, the 
sequence of the P box DNA binding specificity determining region of the first 
zinc finger placed CAR in the thyroid/retinoid group. CAR was not strongly 
related to any other superfamily member, but was most similar to the vitamin D 
receptor, sharing 42 identical amino acids out of 66 in the C domain (64%), and 
61/152 in the E domain (40%). This is quite similar to the relationship between 
the thyroid hormone and retinoic acid receptors: TRp and RARp which share 
60% and 40% sequence identity in the C and E domains, respectively. By 
comparison, the closely related human TRa and TRp receptors share 90% and 
85% identity in the C and E domain. CAR also shows significant similarity to 
the Drosophila melanogaster ecdysone receptor (62% in the C domain, 29Vo in 
the E domain), making it the third member of a divergent subgroup in the 
superfamily. Southern blot analysis (by standard techniques) indicated that 
CAR may not be a member of a closely related subgroup like the TRs and the 
RARs. 

Like a number of other members of the superfamily, CAR has a 
relatively long 5' untranslated region that contains several AUGs upstream of 
the start of the open reading frame. The function of this region is unknown. 
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Northem blot analysis (using standard techniques) indicated that 
CAR mRNA is expressed in a variety of human tissues, but is most abundant in 
liver. Multiple RNA species were observ^ed, but the nature of these different 
species is not clear. Based on the multiple products expressed by genes 
encoding other superfamily members, it may well reflect alternative splicing or 
promoter utilization. CAR mRNA is also observed in a number of human cell 
lines, including HepG2 (hepatoma), JEG-3 (choriocarcinoma), and HeLa. 
Ligand-independent transcriptional activation bv the human CAR 
ligand^dimerization domain 

To study the function of the putative ligand binding domain of the 
orphan, a chimenc receptor consisting of ihe C-terminal D, E and b domains of 
CAR (i.e., approximately amino acids 76-172, 173-319, and 320-348 of the 
CAR sequence, respectively) fused to the N-termmal A/B and C domains of 
TRp was generated (termed TR/CAR), along with a control hybrid with the 
A/B and C domains of TR and the D, E and F domain of the glucocorticoid 
receptor (TR.'GR). Vectors expressing these chimeras or the intact TR were 
cotransfected into JEG3 cells (ATCC Accession No: HTB 36) with reporter 
plasmids containing T3REs (i.e., of sequence 

AA.\GGTAAGATCAGGGACGTGACCGCAG; SEQ ID NO: 3) or GREs 
(from the MMTV promoter as described in Chandler et al.. Cell 33:489, 1989) 
upstream of a reporter gene whose expression could be assayed. Transfections 
were carried out in the presence of serum treated with activated charcoal to 
remove T3 as well as other low molecular weight hydrophobic compounds that 
could act as CAR ligands. 

Results from such an analysis revealed that both TR and the TRy'GR 
chimera behaved as ligand-dependent transactivators, as expected. The 
relatively low but reproducible level of activation conferred by TR/GR is 
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consistent with a previous report, and is thought to be a consequence of the 
absence from this hybrid of transcriptional activation domains present in the 
Ay'B domain of the intact GR and the F domain of the intact TR. 

In contrast, the TR'CAR chimera activated expression of the T3RE 
containing reporter in the absence of any added Hgand. This effect was not 
altered by addition of T3, astraddle, testosterone, resinoid acid, or 
dexamethasone, or by the addition of the orphan receptor ligand 25-hydroxy 
cholesterol. The constitutive activation conferred by this hybrid was 
substantial, correspondmg to greater than 50% of the response of the intact 
TRY. Similar results were observed with an analogous GO/CAR chimera in 
cotransfections with an MMTV/CAT reporter. 

The activation conferred by the TR/CAR chimera in the presence of 
charcoal-stripped serum could be a consequence of either a direct, constitutive 
transcriptional activation function in the D, E or F domains of the orphan, or 
the presence of an uncharacterized ligand in the medium of the growing cells. 
To minimize the possibility of the presence of such a ligand in the media, the 
transfections were repeated in the absence of serum. Such conditions 
substantially reduced the level of both control and T3-activated expression, but 
did not prevent the constitutive activation function conferred by the TR/CAR 
chimera. Because of the substantial changes in the levels of expression from 
control promoters, the significance of the apparent decrease in constitutive 
activity relative to the level of activation conferred by TRY in the presence of 
hormone is uncertain. This apparent decrease may reflect the absence of some 
specific stimulator of CAR function present in serum or could be a less specific 
effect associated with the unfavorable growth conditions. 



w o 99/15555 



PCT.TS98/19365 



-17- 

Activation of resinoid acid response elements by human CAR 

Based on the constitutive activity of the TR/CAR chimera and the 
similanty of the DNA binding domam of CAR to other members of the 
superfamily, a number of response elements were screened for activation by the 
5 intact orphan in standard cotransfections. Despite the relatively close sequence 
relationship between CAR and the vitamin D receptor, no response was seen 
with the combined vitamin D/RA response element from the rat osteocalcin 
gene (i.e., of sequence TGGGTG.A.\TGAGGACATTACTGACCGCTCCG; 
SEQ ID NO: 4). However, CAR did transactivate two wild type elements, the 

GGGTAGGGTTCACCGAAAGTTCACTCG; SEQ ID NO: 5). RAREs which 
w^ere not transactivated by CAR included a potent up mutant version of the rat 
growth hormone gene T3 RE/RARE (i.e., of sequence 

AAAGGTAAGATCAGGGACGTGACCTCAG; SEQ ID NO:6; in tandem 
15 copies with the second copy inverted; described in Brent et aL, Mol. 
EndocrinoL 3: 1996, 1989) and the wild type version of the laminin 
T3 RE/RARE (i.e., of sequence 

AGACAGGTTGACCCTTTTTCTAAGGGCTTAACCTAGCTCACCTG; 
SEQ ID NO: 8). The rat mahc enzyme T3RE (i.e., of sequence 
20 AGGACGTTGGGGTTAGGGGAGGACAGTG; SEQ ID NO: 9) and the rat a- 
myosin heavy chain T3RE (i.e., of sequence 

CTGGAGGTGACAGGAGGACAGCAGCCCTGA: SEQ ID NO: 7), which do 
not respond to RARs, did not respond to CAR. 

The activation of the RARp element by CAR is not affected by 
25 several treatments that activate or inactivate signal transduction pathways 

mediated by protein kinases A or C. Thus, CAR activation was not altered by 
addition of dibutyryl cyclic AMP, by short or long term treatments with 
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phorbol esters, or by cotransfections with vectors expressing the specific 
protein kinase A inhibitor peptide. 

The activation of the R.AREs bv CAR 
play an important role in the complex regulatory network that controls 
expression of RA responsive genes. To examme the functional interactions 
between CAR and RARP, expression vectors for both were mixed in 
cotransfections. Increasing amounts of the CAR expression vector were added 
to a PRARE (SEQ ID NO: 5)-containing CAT reporter plasmid and a fixed 
amount of RARp expression vector corresponding to approximately 2/3 of the 
level that is samrating for induction. In the absence of RA, the increasing 
amount of CAR led to an increase in basal expression. In the presence of RA, 
the high level of activated expression was not strongly affected by addition of 
CAR. Together, these effects resulted in a significant decrease in the RA 
induction ratio with increasing amounts of CAR. 

These results indicate that the two superfamily members function 
independently at this response element. Analogous results were obtained when 
increasing amounts of RARp were added to a subsaturating amount of CAR 
expression vector. In the absence of R.A., the constitutive activation conferred 
by CAR could be blocked by excess RARp. This is consistent with previous 
reports that R.A.Rs can repress expression in the absence of ligand. When 
retinoic acid was added, activation was observed at even moderate levels of 
RARp. The levels of activated expression associated with the various doses of 
RARP vector were similar to those observed in the absence of cotransfected 
CAR. It therefore seems most likely that CAR and RARp do not interfere with 
each other when co-expressed at moderate levels. Under other circumstances, 
more complex indirect effects could be anticipated for the interaction between 
CAR and RXR (see below). 
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In contrast, addition of low levels of RXRa stimulated the effect of a 
subsaturating dose of CAR expression vector. Since similar results are 
obser\^ed with the receptors able to heterodimerize with RXRs, this result 
strongly suggests that RXRoc may share a similar interaction whh CAR, 
5 DNA binding bv human CAR 

To confirm that CAR binds the PRARE, nuclear extracts from HeLa 
cells infected with vaccinia virus vectors overexpressing FLAG epitope-tagged 
versions of CAR or R.\Ra were used in standard gel shift experiments. Both 
CAR- and R.ARa-containing extracts showed specific binding to the R.ARP 

i\) ciciiiciii \ iLJ J K Willi uiiiuiii^ uciiig ui iugiici apuciieiu aiiiiiiLV. 

In agreement with results of cotransfections, CAR binding was strongly 
stimulated by addition of nuclear extract from vaccinia infected HeLa cells 
overexpressing RXRa. However, little or no effect on affinit>' was observed 
when CAR and RAR were mixed. No evidence for formation of CAR/RAR 

1 5 heterodimers was observed in gels electrophoresed for longer times to bener 
resolve the CAR and RAR shifted comnlexes. From these results, it annears 

A --AX 

that CAR can bind directly to the RARP element, and that binding is strongly 
stimulated by RXRa. 

The constitutive activity of CAR may be due to either a truly 

20 constitutive transcriptional activation function or the presence of some 

ubiquitous ligand. The former possibility is supported by its activity in serum 
free media and m distinct cell types. The existence of a ligand may be favored 
by the somewhat lower relative activity observed with serum free medium 
compared to medium containing charcoal stripped serum. However, the 

25 substantial changes in the expression of the control and activated promoters 
under these two quite distinct growth conditions suggest that less specific or 
direct effects could explain this difference. 
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Negative results have been obtained by several approaches designed 
to determine whether the constitutive activation of CAR is associated with 
various second messenger pathways. Fvidence obtained to date indicates that 
CAR function is not affected by activation or repression of the activity of 
5 protein kinases A or C. 

Based on the results presented here, it is possible that CAR plays two 
important roles in the complex, interlocking set of proteins that determines 
responses to RA, T3 and vitamin D. The first is to maintain a basal level of 
expression of a subset of RA responsive genes in the absence of the ligand. In 
1 0 the case of a cell expressing only RARp, for example, this could allow 

expression of sufficient levels of the receptor to allow autoactivation of the RA- 
dependent positive feedback loop that regulates RARp expression upon 
addition of ligand. The second potential function, is based on the interaction of 
CAR with RXR. Increasing expression of CAR would be expected to decrease 
1 5 the amount of RXR available for interaction with other heterodimeric partners. 
Thus, in a cell with limiting amounts of RXR, alterations in the amount or 
activity of CAR protein could have significant effects on the activity of RARs, 
T3Rs or VDR. Although the levels of CAR used in the cotransfections 
reported here did not show an antagonistic effect on RAR activity, preliminary 
20 results indicate that inhibitory effects of this type can be observed in other 
circumstances. Given the remarkable complexity of the regulatory networks 
that control response to the retinoids and the other ligands of this subgroup of 
the nuclear receptor superfamily, it is likely that even more complicated 
functions will be found for CAR. 
25 Isolation of murine CAR receptors 

A number of mouse CAR cDNA clones were isolated by standard 
techniques from a liver cDNA library using an hCAR (MB67) probe. Of 9 
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independent clones examined in detail, 4 had an intact ligand binding domain, 
as judged by compansons to hCAR and other superfamily members. 3 had an 
internal deletion of 107 ammo acids. Representative sequences of these clones 
have been submmed to GenBank. One additional clone included 188 bp of 
5 intron derived sequences; it is unclear whether this clone corresponded to an 
authentic variant or a contaminating nuclear precursor. The mCAR2 sequence 
used for further experiments corresponded to the cDNA clone with the most 
extensive 5' untranslated sequences, and the internal deletion. To generate the 
mCARl sequence used, the deleted region in this mCAR2 clone was replaced 

10 with the corresponding fragment from a clone that did not carry the deletion. 

Characienzaiion of these clones indicated that mCARl and hCAR 
shared a high degree of sequence identity throughout the DNA and ligand 
binding/dimerization domains (Figs. 3A and 3B). Both proteins had ver>' short 
N-terminal regions. mCAR2 was identical to mCARI except for an 

15 out-of- frame 107 bp deletion in the ligand binding/dimerization domain. As a 
consequence of this deletion, only 6 new amino acids in mCAR2 substituted for 
the last 78 residues of mCARL This C-terminal region of mCARl included 
both the heterodimerization interface referred to as the 9th heptad (Forman et 
al., Mol. Endocrinol. 4:1293-1302, 1990, Forman et al., MoL Endocrinoi 

20 3:1610-1626, 1987) or the I box (Perlmann et aL, Moi Endocrinol. 10:958-966, 
1996), and the AF-2 transactivation domain (Danielian et al., EMBO J, 
11:1025-1033, 1992). hCAR and mCAR belong to a small, rather divergent 
subgroup within the nuclear hormone receptor superfamily that also includes an 
orphan receptor from Xenopus laevis (Smith et al., NucL Acids Res, 22:66-71, 

25 1994 ) and the mammalian vitamin D receptor. hCAR and mCAR also showed 
relatively strong similanty with the insect ecdysone receptor in the DNA 
binding domain, but not other domains (Fig. 3B). 



Two lines of evidence demonstrated that mCAR2 was not a cloning 
artifact. The first was simply that this 107 bp segment was missing m more 
than 5 independently isolated cDNAs. The second was a more direct 
demonstration of the existence of bands of the expected size using PGR with 
primers from the E domain and polyA^ mRNA from mouse liver as template. 
As shown in Figures 4A and 4B, the identity of these bands was confirmed by 
Southern blonmg with probes containing either ligand/dimerization domain 
sequences present in both, or only sequences from exon 8. This analysis 
indicated that the mCAR2 variant was a relatively minor fraction of the total 
mRNA. In addition to these two species, an even lesser amount of an 
additional species larger than the mCARl product was also observed. Its size 
indicated that it corresponded to a nuclear precursor that included intron 7, 
which interrupted the coding region at the start of the mCAR2 deletion. A 
cDNA clone including only this intron was isolated, but it was unclear whether 
it represented an additional variant mRNA or a partially spliced nuclear 
precursor. 

The similarity between the human hCAR and murine mCAR 
sequences was significantly less than the 90% or greater identity usually shared 
by true receptor homologs in different mammalian species. It was comparable 
to that shared by the various isoforms of the RARs, for example, suggesting 
that hCAR and mCAR were derived from distinct genes that encode two CAR 
isoforms. 

Structure of the murine CAR gene 

A single clone containing the mCAR gene was obtained from a 
screen of a mouse genomic library. A NotI fragment containing the entire 
mCAR gene and 5' and 3' flanking regions was subcloned and sequenced in its 
entirety. This sequence has been submitted to GenBank. As diagrammed in 
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Figure 5A, the murine mCAR gene was interrupted by eight introns. Exon 8 
was absent in mCAR2, demonstrating that this vanant was generated by 
alternative mRNA splicing. The 5' end of the mCAR transcnpt w^as mapped by 
primer extension (Fig. 5B). 

5 Expression of the murine CAR gene 

To carry out an analysis of murine CAR expression, a Northern blot 
containing 2ug of polyA^ mRNA from a variety of tissues was obtained from 
Clontech, Inc. and hybridized sequentially with either a full length mCAR 
probe or smaller probes generated by PCP^. The N-termma! probe consisting 
10 of 167 bp from the 5' untranslated region, and the C-terminal probe, consisting 
of the last 200 bp of the mCARl coding region, were generated by PCR using 
the mCARl cDNA as a template. (5' untranslated region primers: 5' 
TTCCTACCTACATATGGC 3^ (SEQ ID NO: 13) and 5' 

GACCCTGCTTTCCTTGAGAT 3' (SEQ ID NO: 14); C-terminal pnmers: 
1 ^ s'Ar^TmATrr'Trr APTTrr AT v tt^ xTrk. i s\ r^^A 

5' ACTGCAAATCTCCCCGAGCAGCG 3' (SEQ ID NO: 16)). A 210 bp 3* 
flanking probe extending from a position 130 bp downstream of the polyA 
addition site was generated by PCR using the genomic mCAR clone as 
template. (Primers: 5' TAGGAGGTRGACTAGAGTTCCTTCT 3^ (SEQ ID 
20 NO: 17) and 

5'GATTGAGATATTACTACTCCTTTCTTC3'(SEQIDNO: 18)). Prior to 
each rehybridization, the membrane was exposed for at least 2 days to confirm 
the removal of the previous probe. 

Standard conditions ( Ausube! et al.. Current Protocols in Molecular 
25 Biology. Greene Pub. Assoc.: New York, 1997) were used for mRNA based 
PCR with mouse liver mRNA as a template and pnmers from exons 7 and 9. 



For Southern analysis of the PCR products, probes corresponded to a restriction 
fragment containing the mCARl Hgand binding domain, or a shorter segment 
derived solely from exon 8. Standard condition*; ( Aj.isubel et al.. Current 
Protocols in Molecular Biology^ Greene Pub. Assoc.: New York, 1997) were 
used for primer extension analysis with total mouse liver RNA, and the primer 
5' TGCCTCAGTGCCTGGAAAACAAGGGCCTTCTCTTGC 3' (SEQ ID NO: 
19) derived from the mCAR 5' untranslated region. 

Results from these expenments are shown in Figure 6. In Northern 
blots of polyA^ RNA from vanous mouse tissues hybndized at high stringency 
with CAR probes, mCAR expression was by far the highest in liver, as 
previously described for hCAR. In both mouse and human, the most prominent 
product was a rather broad band of approximately 1 .3 to 1 .7 kb At least in 
mouse, this band presumably included both mCARl and mCAR2 transcripts. 
Additional species of approximately 3.0, 4.0, and 5.7 kb were observed in 
mouse. Larger transcnpts of somewhat different sizes were also present in 
human liver mRNA. All of these larger mouse transcripts were also identified 
by much smaller probes containing only 5* untranslated or ligand/dimerization 
domain sequences (Fig. 6). The hybridization of the 5' probe indicated that 
these various species did not correspond to variants w^ith distinct N-terminal 
sequences, as obsen^ed, for example, with TRpl and TRP2 (Hodm et al., 
Science 244:76-99, 1989). Similady, the identical pattern of hybridization with 
the ligand/dimenzation probe indicated that the various transcnpts did not 
include major substitutions of sequences in that region. Finally, a probe from 
the 3' flanking region of the gene, hybridized only to the three larger species. 
Thus, these larger species were presumably generated as a consequence of a 
lack of addition of a polyA tract at the position used for the shorter transcripts. 
These longer, read-through transcripts must include extensive additional 3' 
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untranslated regions, but it is not known whether the different larger species are 
generated by additional cases of alternative poly A addition, or by alternative 
mRNA splicing. 

In much longer exposures of hybridizations with either the full length 
5 probe or the two probes from the 3' end, an additional transcript of 

approximately 2.8 kb was observed in heart, brain, skeletal muscle, and kidney. 
This did not correspond in size to any of the liver transcripts, and could have 
been either a transcript of the true murine homolog of hCAR, or another variant 
product of the mCAR gene. 
1 v; tJiN^ nmciing ny mv i anci mv m k / 

For DNA binding studies, mCARl, mCAR2, RAR, and RXR 
proteins were expressed using coupled in vitro transcription and translation 
(Promega, Inc.). Standard conditions were used for electrophoretic mobility- 
shift assays with the DR-5 response element from the RARp2 promoter (the 
15 pRAR£) as the probe, as described for hCAR (MB67). In these expenments, 

mCAR 1/T^XR hptPrnHimer*; hminH with ViiaVi ^ffiniUr fn thp^ t? A T? P 

RARP2 isoform promoter (Sucov et al., Proc. Natl. Acad. Sci. USA 87:5392- 
5396, 1990: de The et al.. Nature 343:177-180. 1990) (Fig. 7). As with hCAR, 
the apparent affinity of mCARl/RXR heterodimers for this PRARE element 

20 was indistinguishable from that of RAR/RXR heterodimers. mCAR2 did not 
bind the PRARE or any other element tested either alone or with RXR. This 
lack of mCAR2 binding was consistent with the absence of the C-terminal 
portion of the ligand/dimerization domain, which is essential for 
heterodimerization. 

25 Transactivation by mCARl and m.(:AR2 

To determine whether mCARl transactivated the pRARE in the 
absence of retinoids or any other exogenously added ligands, this construct was 
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cotransfected with a luciferase reponer plasmid in which 3 copies of the 
PRARE were inserted upstream of the TK promoter. To carry out these 
experiments. HepG2 cells were maintamed and transfected using calcium 
phosphate or DEAE dextran (Ausubel et al.. Current Protocols in Molecular 
5 Biology, Greene Pub. Assoc.: New York, 1997). mCARl and mCAR2 

expression vectors were generated by inserting appropriate fragments into the 
vector CDM8 (Seed, Nature 329:840-842. 1987), as previously descnbed for 
hCAR. The luciferase reporter plasmid (Gulick et a!., Proc. Natl. Acad. Sci. 
USA 91:11012-11016. 1994) contamed three copies of the PRARE upstream of 

10 the TK promoter. The reporter plasmids with wild type and mutant versions of 
the RARP2 promoter have been described (Moghal et al., Mol. Cell. Biol. 
11:3945-3959, 1995). Transfections also included the pTKGH control plasmid 
which directed expression of human growth hormone (Selden et al., Mol. Cell. 
Biol. 6:3173-3179, 1986). Luciferase activities were normalized using levels of 

15 growth hormone expression. 

For the TK reporter, pairs of 30 mm dishes were each transfected 
with 2\ig of reporter and pTKGH, and a total of 2^g of mammalian expression 
vector. For the RARP2 reporters, pairs of 30 mm dishes were each transfected 
with 1 .5^g of the reporter and of the mammalian expression vector, and 2|ig of 

20 the pTKGH control. Transfections were carried out in media containing 10% 
charcoal stripped serum, and luciferase activity was determined using the 
Promega luciferase assay system as described by the manufacturer. Luciferase 
values were normalized using the growth hormone expression directed by the 
TKGH internal control. 

The results of these experiments are shown in Figures 8 A and 8B. In 
various cell types and under a variety of conditions, hCAR transactivated this 
reporter approximately 20-100 fold, while mCARl was somewhat more 
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effective, confemng a 50 to 300 fold activation (Fig. 8A). In the presence of 
all-trans retinoic acid, RAR was an even more potent transactivator of this 
reporter. As with hCAR, the apparently constitutive transactivation by mCARl 
was obser\^ed in the presence of serum treated with charcoal to remove 
5 retinoids or other potential ligands, and in several different cell types. As 
shown in Figure 8A, mCAR2 did not transactivate this element, and also did 
not affect transactivation by hCAR or mCARl . Similar results were obtained 
with higher ratios of mCAR2, As expected from these results, mCARl, but not 
mCAR2, was able to transactivate a reporter containing the intact RARp2 

1 O rtrr»mrttp»r Knt not r» mntri-nt ^.Tf^r'c^r\r^ in xTrVi-I-^U -fVi^ n.T> A T> 

inactivated by point mutations (Fig. SB). 

To confirm that the transactivation observed was a direct effect of 
mCARL mutations were introduced into the mCARl AF-2 region. These 
mutations included both simple deletions and point mutants that were chosen 

15 based on comparisons of this region of mCAR to the analogous region in other 
receptors. As indicated in Figures 9 A, the D8 mutation and the point mutants 
L353A and E355A specifically affected the AF-2 motif. The larger D27 
mutation extended into the region homologous to helix 10 of RXR (Bourget et 
al., Nature 325:377-382, 1995), and the analogous helices in the TR (Wagner et 

20 al.. Nature 378:690-697, 1995 ) and RAR (Renaud et al.. Nature 378:681-689, 
1995) structures, but stopped short of the 9th heptad motif (Forman et al., MoL 
Endocrinol 4:1293-1302, 1990 ) within this helix. All of these mutations 
completely abrogated transactivation by mCARl (Fig. 9B). A Westem blot 
with an antibody directed against an epitope tag present at the N-terminus of 

25 the wild type and mutant proteins confirmed that all were expressed at similar 
levels. To confirm expression of mutant derivatives of mCARl, whole cell 
extracts were prepared from appropriately transfected COSM6 cells as 
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described (Ladias et al. J. Biol Chem. 267:15849-15860, 1992). COSM6 cells 
do not express endogenous CAR proteins. These extracts were supplemented 
similarly prepared COSM6 expressed RXR and used for electrophoretic 
mobility shift analysis usmg standard conditions. 
5 As demonstrated in Fig. 9C, all the mutants except D27 were able to 

bind DNA. Although this mutant retained the 9th heptad motif, it was missing 
most of the helix that contained this motif At least in the case of RXR 
homodimers, the C-terminus of this helix is involved in an important dimer 
contact (Bourget et al., Nature 375 :377-382, 1995), 

10 mCAR Isoforms 

Many of the genes that encode the members of the nuclear hormone 
receptor superfamily express more than one protein product as a consequence 
of either alternative promoter utilization or alternative mRNA processing. The 
independent isolation of a number of cDNA clones corresponding to the 

1 5 deleted mCAR2 product initially suggested that the mCAR gene belonged to 
this group, and the existence of distinct mCAR isoforms was confirmed by a 
PCR based approach. The mCAR gene also expressed additional transcripts 
with larger 3' untranslated sequences as a consequence of a lack of poly A 
addition at the primary site. However, these larger transcripts did not encode 

20 variant mCAR proteins. 

The mCARl and hCAR sequences contained a good match to the 
AF-2 transcriptional activation domain at their extreme C-termini. This 
conserv^ed motif is present in many conventional receptors and orphans 
(DanieHan et al., EMBOJ. 11:1025-1033, 1992), and has been directly 

25 associated with ligand dependent transcnptional activation in several 

conventional receptors (e.g., (Barettino et al., EMBOJ. 13:3039-3049, 1994; 
Tone et al., J, Biol. Chem, 269:3 1 1 57-3 1 16L 1994, Tate et al., Mai Cell Biol. 



3NSDOC:0' <WC 9915555A' . _> 



w o 99/15555 



PCT/TJS98/19365 



-29- 

14:2323-2330, 1994; and vom Baur et al., £.1/5(9 7. 11:1 10-124, 1996)). 
Mutation of this conserved motif in mCAR also blocked transactivation. This 
demonstrated that the CAR component of the CAR'RXR heterodimer was 
actively involved in transactivation. It also confirmed a prediction, derived 
5 from both sequence conservation and recent results with HNF-4 (Malik et al., 
MoL Cell. Biol 16:1824-1831, 1996), that the conserved AF-2 motif is 
involved not only in ligand dependent transactivation, but also in the apparently 
ligand independent transactivation observ^ed with several other orphans. 

The mCAR2 variant was missing an additional conserved motif near 
1 0 the C-terminus of tb.e ligand binding/dimerization domain, which has been 
called the 9th hepiad (Forman et ah, MoL Endocrinol. 4:1293-1302, 1990; 
Forman et al., Mol Endocrinol. 3: 1610-1626, 1987) or the I-box (Perlmann et 
al., Moi Endocrinol 10:958-966, 1996), A number of studies with diverse 
receptors have shown that this region is required for heterodimenzation with 
15 RXR (e.g.. Au-Fliegner et aL, Mol Cell Biol 13:5725-5737, 1993), Since 
heterodimenzation is required for high affinity DNA binding, it is not 
surpnsmg that the mCAR2 variant did not bind the pRARE or other elements. 
Particularly since mCAR2 was also missing the conserved AF-2 transactivation 
motif, it should not compete for coactivator binding, and should not inhibit 
20 transactivation by mCARl , a prediction borne out in appropriate transfections. 
This leaves the function of the mCAR2 variant undefined, though it is possible 
that it could compete with the mCARl form for interaction with other, as yet 
undefined, proteins. 

Previous results have demonstrated a positive feedback loop in 
w^hich the R.^\Rp2 transcript is strongly induced by retinoids (Sucov et al., 
Proc, Natl Acad. Set USA 87:5392-5396, 1990; de The et al.. Nature 343:177- 
1 80, 1990). Although the levels of the RARP2 transcnpt are very low in the 
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absence of retinoids in some cultured cell lines, this transcnpt is present at 
significant levels in the livers of vitamin A deficient animals (Haq et al., Proc. 
Natl. Acad. Sci. USA 88:8272-8276, 1991). The combination of the expression 
of mCARl in the liver and its ability to transactivate the R.^Rp2 promoter in 
the absence of any added retinoids clearly suggests a direct role for this orphan 
in maintaining basal levels of RARP2. From a broader perspective, it seems 
likely that mCARl also acts to maintain expression of other retinoid responsive 
genes in the absence of retinoids. 

As previously discussed for hCAR, mCARl could have more 
complex effects in the presence of retinoids. Since it is a less potent 
transactivator than RAR, it could act to decrease overall expression by 
competing for occupancy if both proteins were present at high levels. If both 
CAR and RAR were present at subsaturating levels, however, additional 
occupancy of RAREs by CAR might augment RAR action. The apparent 
ability of endogenous RARs to ftilly occupy specific R.\R£s under at least 
some in vivo conditions (Dey et al., Mol. Cell Biol. 14:8191, 1994) suggests 
that competition for binding could be an important aspect of CAR function. In 
contrast, the observation that supraphysiologic expression of a receptor usually 
substantially augments response (e.g.. Brent et al., J. Biol. Chem. 264:178, 
1989; Webb et al.. Mol. Endocrinol. 6:157, 1992) is consistent with the 
possibility that CAR cooperates with the RARs. 

The relative roles of CAR and RAR on a particular element are 
clearly dependent, not only on their relative levels of expression, but also on 
their relative DNA binding affinities. mCARI/RXR heterodimers, like 
hCAR/RXR heterodimers, bound the pRARE with an affinity indistinguishable 
from that of RAR'RXR heterodimers, and CAR/RXR complexes can also bind 
with high affinity to other DR-5 and DR-2 RAREs. Unexpectedly, however. 
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recent results have demonstrated that the affinity of CAR RXR heterodimers, 
but not RAR/RXR heterodimers, for DR-5 sites is strikingly dependent on the 
sequence of the 5 base pair spacer. This suggests that CAR targets only a 
subset of retinoid responsive genes, and provides yet another example of the 
5 complexity of the overlapping network of regulatory effects of the nuclear 
hormone receptors. 
Poh^^eptide Expression 

Polypeptides according to the invention may be produced by 
transformation of a suitable host cell with all or part of a CAR receptor- 

10 encoding cDNA fragment (e.g , the cDNA. described above) in a suitable 
expression vehicle. 

Those skilled in the field of molecular biology will understand that 
any of a wide variety of expression systems may be used to provide the 
recombinant receptor protein. The precise host cell used is not critical to the 

1 5 invention. The CAR receptor may be produced in a prokaryotic host (e.g., E. 
coli) or in a eukary-otic host (e.g., Saccharomyces cerevisiae or mammalian 
ceils, e.g., COS 1, NIH 3T3, and JEG3 cells). Such cells are available from a 
wide range of sources (e.g., the American Type Culture Collection, Rockland, 
MD; also, see, e.g., Ausubel et aL, Current Protocols in Molecular Biology, 

20 John Wiley & Sons, New York, 1989). The method of transfection and the 
choice of expression vehicle will depend on the host system selected. 
Transformation and transfection methods are described, e.g., in Ausubel et al. 
{Current Protocols in Molecular Biology^, John Wiley & Sons, New York, 
1989); expression vehicles may be chosen from those provided, e.g., in Cloning 

25 Vectors: A Laborator>' Manual (P.H, Pouwels et al., 1985, Supp. 1987). 

One preferred expression system is the mouse 3T3 fibroblast host 
cell transfected with a pMAMneo expression vector (Clontech, Palo Alto, CA). 



pMAMneo provides: an RSV-LTR enhancer linked to a dexamethasone- 
mducible MMTV-LTR promotor, an SV40 ongin of replication which allows 
replication in mammalian systems, a selectable neomycin gene, and SV40 
splicing and polyadenylation sites. DNA encoding a CAR receptor polypeptide 
would be inserted into the pMAMneo vector in an orientation designed to allow 
expression. The recombinant receptor protein would be isolated as descnbed 
below. Other preferable host cells which may be used in conjunction with the 
pMAMneo expression vehicle include COS cells and CHO cells (ATCC 
Accession Nos. CRL 1650 and CCL 61, respectively). 

Alternatively, a CAR receptor polypeptide is produced by a stably- 
transfected mammalian cell line. 

A number of vectors suitable for stable transfection of mammalian 
cells are available to the public, e.g., see Pouwels et aL (supra); methods for 
constructing such cell lines are also publicly available, e.g., in Ausubel et al. 
(supra). In one example, cDNA encoding the receptor polypeptide is cloned 
into an expression vector which includes the dihydrofolate reductase (DHFR) 
gene. Integration of the plasmid and, therefore, the CAR receptor-encoding 
gene into the host cell chromosome is selected for by inclusion of 0.01-300 
methotrexate m the cell culture medium (as described in Ausubel et al., supra ). 
This dominant selection can be accomplished in most cell types. Recombinant 
protein expression can be increased by DHFR-mediated amplification of the 
transfected gene. Methods for selecting cell lines bearing gene amplifications 
are described in Ausubel et al. ( supra ): such methods generally involve 
extended culture in medium containing gradually increasing levels of 
methotrexate. DHFR-containing expression vectors commonly used for this 
purpose include pCVSEII-DHRF and pAdD26SV(A) (described in Ausubel et 
al., supra). Any of the host cells described above or, preferably, a DHFR- 
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deficient CHO cell line (e.g., CHO DHFR cells, ATCC Accession No. CRL 
9096) are among the host cells preferred for DHFR selection of a stably- 
transfected cell line or DHFR-mediated gene amplification. 

Once the recombinant CAR receptor polypeptide is expressed, it is 
isolated, e.g., using affinity chromatography. In one example, a CAR binding 
site (e.g., the pRARE site described above) or an anti-CAR receptor antibody 
( e.g., produced as described below) may be attached to a column and used to 
isolate the receptor polypeptide. Lysis and fractionation of receptor-harboring 
cells prior to affinity chromatography may be performed by standard methods 

( p a AiT<;nhel pt rtl c;nr>rn^ C^rtnf^ icr-»lc»+/ar? tVt^ ^ar^^^U^^^-r^* :-r 

desired, be further purified, e.g., by high performance liquid chromatography 
(see, e.g.. Fisher, Laboratoiy Techniques In Biochemistry^ And Molecular 
Biology, eds.. Work and Burdon, Elsevier, 1980). 

Receptors of the invention, particularly short receptor fragments, can 
also be produced by chemical synthesis (e.g., by the methods described in 
Solid Phase Peptide Synthesis, 2nd ed., 1984 The Pierce Chemical Co., 
Rockford, IL). 

These general techniques of polypeptide expression and purification 
can also be used to produce and isolate useftil CAR receptor fragments or 
analogs (described below). 

Identifica tion of Ligands which Bind CAR Receptors 

Although the human CAR receptor described above was capable of 
activating some level of target gene expression in the apparent absence of a 
Iigand, this does not discount the possibility that the receptor interacts with one 
or more ligands, e.g., to modulate receptor activity. Moreover, it is possible 
that multiple CAR receptors exist (e.g., as the products of a differenfially- 
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spliced CAR mRNA) and that different receptor species interact with different 
hgands. Accordingly, one aspect of the invention features a screening assay for 
the identification of compounds which specifically bind to the CAR receptors 
described herein. Such an assay may be earned out using a recombinant 
5 receptor protein. 

In one example, the CAR receptor component is produced by a cell 
that naturally produces substantially no receptor or by a cell which produces 
functionally deficient receptor (i.e., a cell which apparently expresses CAR 
receptor mRNA as measured by Northern blotting but which does exhibit 

10 reporter gene induction, in the absence of recombinantly-produced CAR 

receptor, in a transactivation assay, see below); suitable cells are, e.g., those 
discussed above with respect to the production of recombinant receptor, most 
preferably, mammalian cells such as JEG3 cells. Host cells are transfected with 
(1) a vector which expresses a nucleic acid encoding the CAR receptor 

1 5 component (i.e., the "producer vector") and (2) a vector which includes a CAR 
receptor binding site (e.g., the pRARE sequence 

GGGTAGGGTTCACCGAAAGTTCACTCG; SEQ ID NO: 5; described 
above) positioned upstream of a target gene which may be assayed (e.g., a CAT 
gene or a P-galactosidase gene) (i.e., the "reporter vector"). Using such a 

20 standard transactivation assay procedure, CAR receptor activity is assayed by 
measuring CAR binding site-dependent target gene expression. CAR hgands 
are identified as those compounds which, when added to the host cell medium, 
effect a change in CAR receptor-directed gene expression (as detected using 
any CAR reporter vector); a CAR ligand according to the invention may either 

25 increase CAR receptor activity or decrease CAR receptor activity. 

Any suitable transactivation technique, CAR receptor-encoding 
producer vector, and CAR receptor binding site-containing reporter vector may 
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be used. Descnptions of transactivation assays and generally useful vectors for 
the identification of ligands which bind other nuclear hormone receptors are 
described, e.g., in Evans et al. (U.S. Pat. No. 4,98 1J84, 1991); Evans et al. 
(WO 90/07517); Evans et al. (WO90/01428); and WO88/03168; all hereby 
5 incorporated by reference. CAR receptor polypeptides which may be used to 
screen for CAR ligands include wild-type molecules as well as any appropriate 
chimeric receptor, for example, the GR/CAR and TR^'CAR receptors described 
above. 

Candidate ligands may be purified (or substantially puritled) 

J wii*^ w»j*ii|^vyxA*^x>i. ixix^vkV^l \Ji. il^U.liV_10 ail 

cAlrauL or supemaiani obtained from ceils; Ausubei et al., supra ). In a mixed 
ligand assay, the CAR ligand is identified by testing progressively smaller 
subsets of the ligand pool (e.g., produced by standard purification techniques, 
e.g., HPLC or FPLC) until a single ligand is finally demonstrated to modulate 

15 the CAR receptor gene stimulatory activity. Candidate CAR ligands include 
peptide as well as non-peptide molecules. 

Alternatively, a ligand may be identified by its ability to bind a CAR 
receptor polypeptide using affinity chromatography. Recombinant receptor is 
purified by standard techniques, from cells engineered to express the receptor 

20 (e.g., those described above); the recombinant receptor immobilized on a 
column (e.g., a Sepharose column or a streptavidin-agarose column by the 
immunoaffinity method of Ausubei et al., supra ) and a solution containing one 
or more candidate ligands is passed through the column. Such a solution (i.e., 
such a source of candidate ligands) may be, e.g., a cell extract, mammalian 

25 serumi, or growth medium on which mammalian cells have been culmred and 
into which the cells have secreted factors (e.g., growth factors) during culture; 
again, candidate CAR ligands include peptide as well as non-peptide 



molecules. A ligand specific for a recombinant receptor is immobilized on the 
column (because of its interaction with the receptor). To isolate the ligand. the 
column is first washed to remove non-specifically bound molecules, and the 
ligand of interest is then released from the column and collected. 

CAR ligands isolated by the above methods (or any other appropriate 
method) may, if desired, be ftirther purified (e.g., by high performance liquid 
chromatography; see above). Once isolated in sufficiently-purified form, a 
novel peptide ligand may be partially sequenced (by standard amino acid 
sequencing techniques). From this partial amino acid sequence, a partial 
nucleic acid sequence is deduced which allows the preparation of primers for 
PCR cloning of the ligand gene (e.g., by the method of Ausubel et al., supra V 
Identification of CAR Receptor DNA Rinding Sites 

Identification of the CAR receptor facilitates identification of its 
DNA binding site(s). According to one approach, CAR receptor DNA binding 
sites may be idenfified using a transactivation assay, e.g., as described above 
for the identification of the binding site of sequence 
GGGTAGGGTTCACCGAAAGTTCACTCG (SEQ ID NO: 5). Briefly, 
candidate DNA binding sites are inserted upstream of a target gene (whose 
expression may be assayed, e.g., those genes described above) and the ability of 
a CAR receptor polypeptide to bind the DNA site is assayed as its ability to 
acfivate downstream gene expression. 

Alternatively, a DNA binding site may be identified by selectively 
retaining a receptor-bound DNA fragment on a nitrocellulose filter. This 
approach relies on the ability of nitrocellulose to bind proteins but not double- 
stranded DNA. Purified CAR receptor polypeptide (e.g., purified by standard 
techniques from cells engineered to express the receptor, e.g., those described 
above) is mixed with labelled double-stranded DNA (e.g., a random pool of 
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DNA fragments ) under conditions which allow interaction. After incubation, 
the mixmre is suction filtered through nitrocellulose, allowing unbound DNA 
to pass through the filter while retaining the protein and any DNA specifically 
bound to It. Bound DNA fragments are then eluted from the filter and analyzed 
5 by gel electrophoresis or ampHfication and cloning. A detailed description of 
this technique is published in Ausubel et al., supra ). 

Candidate DNA fragments for either approach may be derived from 
a randomly cleaved or sonicated genomic DNA library and'or may be derived 
from known nuclear hormone response elements (see, e.g., Evans et al, 
10 WO90/11273). 

lucnLification of CAR receptor DNA binding sites tacilitates a search 
for the presence of such sites upstream of known or yet unidentified genes (e.g., 
by an examination of sequences upstream of known genes or by standard 
hybridizafion screening of a genomic library with binding site probes). CAR- 

15 mediated transcriptional control of genes bearing the binding site upstream may 
then be investigated (e.g., by transactivation experiments as described above), 
potentially leading to the elucidation of novel CAR receptor functions. 
Chimeric Receptors 

The fimctional domains of the CAR receptor may be swapped with 

20 the domains of other members of the nuclear hormone receptor family (see, 
e.g., Evans et al., WO 90/1 1273; Evans, Science 240:889, 1988) in order to 
produce receptors having novel properties. For example, if the DNA binding 
domain of the glucocorticoid receptor were fused to the gene activation domain 
of the CAR receptor, a novel receptor would be produced which could bind 

25 genes bearing an upstream glucocorticoid response element and activate gene 
expression in the absence of hormone. Conversely, ftision of the CAR DNA 
binding domain to the ligand-binding and gene activation domains of 



-38- 

glucocorticoid receptor would confer hormonal regulation on genes 
downstream of CAR bmdmg sites. Fmally, fusion of the CAR DNA binding 
domain to a trans-repressmg domain ( see, e.g., Evans et al., WO90/14356) 
would result in repression of the basal level of expression of genes bearing 
5 upstream CAR binding sites. Construction of receptor fusion genes is carried 
out by standard techniques of molecular biology. CAR receptor domains are as 
follows: DNA binding domain, approximately amino acids 11 -76; and gene 
activation and potential ligand binding domain, approximately amino acids 76- 
348. Examples of receptor domains which may be included in a chimeric CAR 

10 receptor are described in Evans et al. (WO 90/15815) and in Evans et al. 
(Science 240:S89, 1988). 
Dominant Negative Mutants 

Mutants of the CAR receptor may be generated which interfere with 
normal CAR receptor activity. Such mutants are termed "dominant negative" 

15 and fall into at least two classes: (a) ones which bind to their DNA binding site 
( thereby interfering with the ability of wild-type receptor to bind the same site) 
and which do not activate gene expression and (b) ones which heterodimerize 
with other receptors (e.g., RXR) but which do not promote the biological 
response associated with the wild-type heterodimer. 

20 The first class of CAR dominant negative mutants include those 

receptor polypeptides which contain a wild-type DNA binding domain and a 
mutant gene activation domain. Such mutants are unable to transactivate a 
reporter gene (e.g., as measured using a CAT reporter gene with an upstream 
pRARE and the standard methods described above) but retain the ability to 

25 bind a CAR DNA binding site (as evidenced, e.g., by DNA footprint analysis 
using a pR.ARE DNA sequence; Ausubel et al., supra ). 
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The second class of CAR dominant negative mutants include those 
receptor polypeptides which contain a \vild-t>T3e heterodimerization domain. 
Such a mutant interacts with its heterodimer partner and disrupts the partner's 
function. In one particular example, a dominant negative CAR receptor 
5 polypeptide may be overproduced (e.g., by directing its expression from a very 
strong promoter); the abundant CAR receptor polypeptide forms heterodimers 
with cellular RXR protein, soaking up available RXR and thereby preventing 
RXR homodimer formation as w^ell as RXR heterodimer formation with other 
partner proteins (e.g., RAR, VT3R, and T3R). Wild-type CAR receptor 

1 0 nolvnpntirle mpiv "fiinrtion fic a HominQnt t^f^cTctfi\rf^ T-niitartt if* /^Tr<iT-r>T-r\i-ln#^c»>-I i^ 

this manner. However, a mutant CAR receptor lacking a gene activation 
domain (e.g., as identified above) and/br a DNA binding domain (e.g., as 
identified by DNA footprint analysis, above) is preferred. 

Any of the above mutants may be generated by any method of 

1 5 random or site-directed DNA mutagenesis (see, e.g., Ausubel et al,, supra ). 
Identification of Molecules which Modulate CAR Receptor Expression 

Isolation of CAR receptor genes also facilitates the identification of 
molecules which increase or decrease CAR receptor expression, and which 
may be useful as therapeutics, e.g., for treatment of cancers such as lung 

20 cancer, or for treatment of thyroid disorders such as Graves* disease. 

According to one approach, candidate molecules (e.g., peptide or non-peptide 
molecules found, e.g., in a cell extract, mammalian serum, or growth medium 
on which mammalian cells have been cultured) are added at varying 
concentrations to the culture medium of cells w^hich express CAR receptor 

25 mRNA (e.g., HepG2, JEG-3, or HeLa cells). CAR receptor expression is then 
measured by standard Northern blot analysis (Ausubel et al., supra) using CAR 
receptor cDNA as a hybridization probe. The level of CAR receptor expression 
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in the presence of the candidate molecule is compared to the level measured for 
the same cells in the same culture medium but in the absence of the candidate 
molecule. A molecule which promotes an increase or decrease in CAR 
receptor expression is considered useful in the invention. 
Anti-CAR Receptor Antibodies 

Human or murine CAR receptor (or immunogenic receptor 
fragments or analogues) may be used to raise antibodies useful in the invention; 
such polypeptides may be produced by recombinant or peptide synthetic 
techniques (see, e.g.. Solid Phase Peptide Synthesis, supra : Ausubel et aL, 
supra). The peptides may be coupled to a earner protein, such as KLH as 
described in Ausubel et al, supra . The KLH-peptide is mixed with Freund^s 
adjuvant and injected into guinea pigs, rats, or preferably rabbits. Antibodies 
may be purified by peptide antigen affinity chromatography. 

Monoclonal antibodies may be prepared using the CAR polypeptides 
described above and standard hybridoma technology (see, e.g., Kohler et aL, 
Nature 256:495, 1975; Kohler et aL, Eur, J. Immunol. 6:511, 1976; Kohler et 
al., Eur. J. Immunol. 6:292, 1976; Hammeriing et aL, In Monoclonal Antibodies 
and T Cell Hybridomas. Elsevier, NY, 1981; Ausubel et aL, supra V 

Once produced, polyclonal or monoclonal antibodies are tested for 
specific CAR receptor recognition by Western blot or immunoprecipitation 
analysis (by the methods described m Ausubel et aL, supra ). Antibodies which 
specifically recognize a CAR receptor polypeptide are considered to be useful 
in the invention; such antibodies may be used, e.g., in an immunoassay to 
monitor the level of CAR receptor produced by a mammal. 
Therapy 

Because a lack of retinoic acid receptor has been associated with the 
occurrence of lung cancer and because the CAR receptor polypeptide binds and 
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activates expression of the reiinoic acid receptor gene, it is likely that the 
administration of a CAR receptor polypeptide to a mammal may prevent or 
treat cancers, particularly lung cancer. Similar therapeutic results would be 
expected for administration of a ligand which stimulates CAR receptor activity. 
5 CAR receptor polypeptides may also find therapeutic use in the 

treatment of Graves disease, a disease resulting from an increase in thyroid 
hormone receptor function. RXR protein plays a role in thyroid hormone 
receptor expression. Accordingly, dominant negative CAR mutants w^hich 
heterodimerize with RXR protein (including overexpressed wild-type CAR 

10 receptor protein^ may act to decrease the cellular levels of available RXR. and 
thereby decrease thyTOid homiOiic receptor function. Again, iigands which 
increase heterodimerization efficiency could also be administered as a 
treatment for Graves' disease. 

To treat the above diseases, the appropriate CAR receptor 

1 5 polypeptide (or ligand) is administered as a therapeutic preparation (e.g., in 

physiological saline) m accordance with the condition to be treated. Ordinarily, 
it will be administered intravenously, at a dosage effective to increase retinoic 
acid receptor expression (as a cancer treatment) or effective to decrease thyroid 
hormone receptor function (as a treatment for Graves' disease). Alternatively, 

20 it may be convenient to administer the therapeutic orally, nasally, or topically, 
e.g., as a liquid or a spray. Again, the dosages are as described above. 
Treatment may be repeated as necessary for alleviation of disease symptoms. 

The methods of the invention may be used to reduce the disorders 
described herein in any mammal, for example, humans, domestic pets, or 

25 livestock. WTiere a non-human mammal is treated, the CAR receptor 

polypeptide or the antibody employed is preferably specific for that species 
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Other Emhodimentq 
Polypeptides according to the invention include the entire human or 
munne CAR receptor sequences (as described in Figs. 1 and 2; SEQ ID NOS:l. 
10, 1 1, and 12) as well as any analog or fragment of the human or murine CAR 
5 receptors which include either a DNA binding domain or gene activation 

domain; or which include a heterodimerization domain (as identified using the 
techniques described above). 

Polypeptides of the invention also include all mRNA processing 
variants (e.g., all products of alternative splicing or differential promoter 
10 utilization) as well as CAR receptor proteins from other mammals. 

Specific receptor fragments or analogues of interest include full- 
length or partial (see below) receptor proteins including an amino acid 
sequence which differs only by conservative amino acid substitutions, for 
example, substitution of one amino acid for another of the same class (e.g., 
1 5 valine for glycine, arginine for lysine, etc.) or by one or more non-conservative 
amino acid substitutions, deletions, or insertions located at positions of the 
amino acid sequence which do not destroy the receptors' ability to either bind 
DNA and activate transcription; or to interact with CAR receptor's 
heterodimerization partners (as assayed above). Analogs also include receptor 
20 polypeptides which are modified for the purpose of increasing peptide stability; 
such analogs may contain, e.g., one or more desaturated peptide bonds or D- 
ammo acids in the peptide sequence or the peptide may be formulated as a 
cyclized peptide molecule. 

Other embodiments are within the following claims. 
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Claims 

1 . Substantially pure CAR receptor polypeptide. 

2. The polypeptide of claim 1, compnsing an amino acid 
sequence substantially identical to the amino acid sequence shown in Fig. 2 

5 (SEQIDNO:12). 

3. A substantially pure polypeptide comprising a CAR receptor 
DNA binding domain and a CAR receptor ligand binding/ gene activation 
domain. 

4. The polypeptide of claim 3, wherein said DNA binding 

10 domain comprises a sequence substantially identical to ammo acids 21-86 of 
Fig. 2 (SEQ ID NO: 12), or a DNA binding fragment thereof 

5. The polypeptide of claim 3, wherein said ligand binding/gene 
activation domain comprises a sequence substantially identical to amino acids 
182-358 of Fig. 2 (SEQ ID NO: 12), or a gene activatmg fragment thereof 

1 5 6, The polypeptide of claims 1 , wherein said receptor is 

mammalian. 

7. The polypeptide of claim 6, wherein said receptor is murine. 

8. The polypeptide of claim 7, wherein said polypeptide 
composes the ammo acid sequence of SEQ ID NO: 12. 



1. 
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Substantially pure DNA which encodes a polypeptide of claim 



10. The DNA of claim 9, wherein said DNA encodes a munne 
CAR receptor. 

1 1 . The DNA of claim 10, wherein said DNA encodes the amino 
acid sequence of SEQ. ID NO: 12. 

12. A vector comprising the DNA of claim 9, said vector being 
capable of directing expression of the protein encoded by said DNA in a 
vector-containing cell. 

13. A cell which contains the DNA of claim 9. 

1 4. A method of producing a recombinant CAR receptor 
polypeptide or a fragment or analog thereof comprising: 

(a) providing a cell transformed with DNA encoding a CAR receptor 
or a fragment or analog thereof positioned for expression is said cell; 

(b) culturing said transformed cell under conditions for expressing 
said DNA; and 

(c) isolating said recombinant CAR receptor polypeptide. 

15. A substantially pure antibody which specifically binds a 
polypeptide of claim 1. 

1 6. A method of identifying a CAR ligand, comprising 
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(a) providing a nucleic acid sequence which encodes a CAR receptor 
polypeptide; 

( b) introducing into a host cell which is functionally deficient for 
CAR receptor (i) said nucleic acid which encodes said CAR receptor 
polypeptide and (ii) a reporter gene operably linked to a CAR receptor 
polypeptide binding site; 

( c ) measuring induction of said reporter gene in said transfected host 

cell; 

(d) contacting said transfected host cell with a candidate ligand; and 
(e ) measuring induction of said reporter gene in the presence of said 
candidate hgand, an increase or decrease in said induction as compared to the 
induction in (c) being indicative of the presence of a CAR ligand. 

17. The method of claim 16, wherein said CAR receptor 
polypeptide comprises an amino acid sequence substantially identical to the 
amino acid sequence shown in Fig. 1 (SEQ ID NO: 10) or Fig. 2 ( SEQ ID 
NO:12). 

1 8. The method of claim 16, wherein said CAR receptor 
polypeptide binding site is GGGTAGGGTTCACCGAAAGTTCACTCG (SEQ 
ID NO: 5). 

19- A method of identifying a CAR ligand, comprising 

(a) providing a nucleic acid sequence which encodes a CAR receptor 
polypeptide; 

(b) introducing said nucleic acid into a host cell such that said 
recombinant CAR receptor polypeptide is expressed; 
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(c ) isolating said recombinant protein; 

(d) immobilizing said recombinant protein on a solid substrate; 

(e) contacting said immobilized recombinant protem with a candidate 
ligand under conditions which allow formation of an affinity complex between 
said immobilized recombinant CAR receptor polypeptide and said candidate 
ligand; and 

(f) detecting complex formation as an indication of the presence of a 
CAR ligand. 

20. The method of claim 19, wherein said CAR receptor 
polypeptide comprises an amino acid sequence substantially identical to the 
amino acid sequence shown in Fig. 1 (SEQ ID NO: 10) or Fig. 2 (SEQ ID 
NO: 12). 

2 1 , The method of claim 20, wherein said solid substrate is a 

column. 
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GTGAGCTTGC TCCTTAAGTT ACAGGAACTC TCCTTAT;^„\T AGACACTTCA TTTTCCTAGT 
CCATCCCTCA TGAAAAATGA CTGACCACTG CTGGGCAGCA GG AG GG AT G A TAATCCTAAC 
TCCAATCACT GGCAACTCCT GAGATCAGAG GA.=^-»\CCAGC AACAGCGTGG GAGTTTGGGG 
AGAGGCATTC CATACCAGAT TCTGTGGCCT GCAGGTGACA TGGTGCCTAA GAGAAGGAGG 
AGTCTGTGAC AGCCACCCCA ACACGTGACG tc 

ATG GCC AGT AGG GAA gat gag CTG AGG i-\C TGT GTG GTA TGT GGG GAG 
Met Ala Ser Arg Glu Asd Glu Leu Arg Asa C\'s ^'al Val Cys Giy Asp 
1 5 10 ' 15 

CAA GCC ACA GGC TAG CAC TTT AA.T GCC CTG ACT TGT GAG GGC TGC AAG 
Gin Ala Thr Gly Tyr His Phe Asn Ala Leu Thr Cys Glu Gly Cys Lys 
20 25 30 

GGT TTC TTC AGG AGA ACA GTC AGC AAA AGC ATT GGT CCC ACC TGC CCC 
Gly Fhe ?he Arg Aivg Thr Vdl Sgt Lys Ser He Glv Pro Tnr Cvs Pto 
35 40 ' 45 * 

TTT GCT GGA AGC TGT GAA GTC AGC AAG ACT CAG AGG CGC CAP TGC cCA 
Phe Ala Gly Ser Cys Glu Val Ser Lys Thr Gin Arg Arg His Cys Pro 
50 55 60 

GCC TGC AGG TTG CAG AAG TGC TTA GAT GCT GGC ATG AGG AAA GAC ATG 
Ala Cys Arg Leu Gin Lys Cys Leu Asp Ala Gly Met Arg Lys Asp Met 
65 ' 70 ' ' 75^ 80 

ATA CTG TCG GCA GAA GCC CTG GCA TTG CGG CGA GCA AAG CAG GCC CAG 
lie Leu Ser Ala Glu Ala Leu Ala Leu Arg Arg Ala Lys Gin Ala Gin 
85 90 95 

CGG CGG GCA CAG CAA ACA CCT GTG CAA CTG AGT AAG GAG CAA GAA GAG 
Arg Arg Ala Gin Gin Thr Pro Val Gin Leu Ser Lvs Glu Gin Glu Glu 
100 105 110 

CTG ATC CGG ACA CTC CTG GGG GCC CAC ACC CGC CAC ATG GGC ACC ATG 
Leu lie Arg Thr Leu Leu Gly Ala His Thr Arg His Met Gly Thr Met 
115 120 125 

TTT GAA CAG TTT GTG CAG TTT AGG CCT CCA GCT CAT CTG TTC ATC CAT 
Phe Glu Gin Phe Val Gin Phe Arg Pro Pro Ala His Leu Phe lie His 
130 135 140 

CAC CAG CCC TTG CCC ACC CTG GCC CCT GTG CTG CCT CTG GTC ACA CAC 
His Gin Pro Leu Pro Thr Leu Ala Pro Val Leu Pro Leu Val Thr His 
145 150 155 160 

TTC GCA GAC ATC AAC ACT TTC ATG GTA CTG C^-J-. GTC ATC AAG TTT ACT 
Phe Ala Asp lie Asn Thr Phe Met Val Leu Gin Val He Lys Phe Thr 
165 170 175 



FIG. 1 

(PAGE 1 OF 2) 



2/11 



AAG GAC CTG CCC GTC TTC CGT TCC CTG CCC ATT GAA GAG GAG ATC TCC 
T.yc; A<^n T.ian p-rn V;=». 1 php Ar'^ Ser L'='' Pr-'^- I ^_ r, n G\'^ lie Ser 

IBO 185 " 190 

CTT CTC AAG GGA GCA GCT GTG GAA ATC TGT CAC ATC GTA CTC AAT ACC 
Leu Leu Lys Gly Ala Ala Val Glu lie Cys His He Val Leu Asn Thr 
195 200 205 

ACT TTC TGT CTC CAA ACA CAA AAC TTC CTC TGC GGG OCT CTT CGC TAG 
Thr Phe Cys Leu Gin Thr Gin Asn Phe Leu Cys Gly Pro Leu Arg Tyr 
210 215 220 

ACA ATT GAA GAT GGA CCC CGT GTG GGG TTC CAG GTA GAG TTT TTG GAG 
Thr He Glu Asp Gly Ala Arg Val Gly Phe Gin Val Glu Phe Leu Glu 
225 230 235 240 

TTG CTC TTT CAC TTC CAT GGA ACA CTA CGA ;^J^A CTG CAG CTC CAA GAG 
Leu Leu Phe Hxs Phe His Gly Thr Leu Arg Lys Leu Gin Leu Gin Glu 
245 250 255 

CCT GAG TAT GTG CTC TTG GCT CCC ATG GCC CTG TTC TCT CCT GAC CGA 
Pro Glu Tyr Val Leu Leu Ala Ala Met Ala Leu Phe Ser Pro Asp Arg 
260 265 270 

CCT GGA GTT ACC CAG AGA GAT GAG ATT GAT CAG CTG CAA GAG GAG ATG 
Pro Gly Val Thr Gin Arg Asp Glu He Asp Gin Leu Gin Glu Glu Met 
275 280 285 

GCA CTG ACT CTG CAA AGC TAG ATC AAG GGC CAG CAG CGA AGG CCC CGG 
Ala Leu Thr Leu Gin Ser Tyr He Lys Gly Gin Gin Arg Arg Pro Arg 
290 295 300 

GAT CGG TTT CTG TAT GCG AAG TTG CTA GGC CTG CTG GCT GAG CTC CGG 
Asp Arg Phe Leu Tyr Ala Lys Leu Leu Gly Leu Leu Ala Glu Leu Arg 
305 310 315 320 

AGC ATT AAT GAG GCC TAG GGG TAG CAA ATC CAG CAC ATC CAG GGC CTG 
Ser He Asn Glu Ala Tyr Gly Tyr Gin He Gin His He Gin Gly Leu 
325 330 335 

TCT GCC ATG ATG CCG CTG CTC CAG GAG ATC TGC AGC TGA GGCCATGCTC 
Ser Ala Met Met Pro Leu Leu Gin Glu He Cys Ser 
340 345 

ACTTCCTTCC CCAGCTCACC TGGAACACCC TGGATACACT GGAGTGGGAA 

AATGCTGGGA CC.AAAGATTG GGCCGGGTTC AAAGGGAGCC CAGTGGTTGC AATGAAAGA 

TAAAGCAAAA C 
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C'l'Z^'I'^l^l^l' CC AGGO A.GT*0 AGOAC CGs— AGTi— — CT A-r^'r^GCT'Gvj\_*-\ j1*TG CTG AG A « C ^ 
I ^ ^ ^ ^ ^ ^ 60 

G AACA-A-AAGOTCGGT^ AC*TG CT^OG'w^ i ^r^vJ^»J^JrtT*Ti-^-rt.GGrtCC j » ^r^AoOACTCT.—to^^T^ 



61 * * 120 

C CT*'T^ G GT' C G G AGTC G C T C CT C C GG A G C C C T C TC C TC C G Av-rG rtT\jTGG G 'TT* AG AA.C A C C 



G C G^ G G C T AAtGOG AA-AG AGG AG A G G ATX^ AC A G G 'P A'TG G AA. C A C T* A G A_-'_A G C AT" G G G G 

121 ^ + * 13C 

GG AGGAG'GG AT^GGG'X*X"T'GTG GTGTGsj * rtGTG ^ GGATACGAT*TGTG*— . j. CPG^TGGTAGv— GG 

M T A M- L T L E T M A 

AGTGAACyAAGA-ATA'I^GGGG '^^rtvjvjj-Lj-'.^'XXj x >j'TG\j'PG * G'TG^Jrt*or\CC^J^J^J ^ C ACr^Gw^T'.*\ x 
131^ + + ^ + 240 

SZEEYGPRNCVVCGDRATGY 

241 H, ^ + + + -^ 300 

gtaaj^gctgcgggactg^j\cactccggaggttggcgaag;^j^.gtctggttgtcagtcgttt 

HFHALTCEGCKGFFRR TVSK 

AGCA.TTG-GTCCCATCTGTCCGTTTGCTGGAAGGTGTGAGGTCAGCJ^-AGGCGCAGAGACGC 

2QI + + ^ + + 360 

t^j^. t 1 ^ n a g<3g t ag a.c a.g-g g a_aa.c g ac c tt c c ac a.c t c c a.g g c g t t c c ggg t c t c tg c g 

TIGPI CPFAGRCEVSKAQRE 

CAGTGTGCAGCCTGCAGGTTGCAGA_AGTGTGTAAATGTTGGCATGAGGA^-AGAGATGATA 

361 + + * + * 420 

GTGACAGGTCGGACGTCCAACGTCTTGAGAGATTTACAACCGTACTCGTTTCTGTAGTAT 

HGPACRLQKCLKVGMRKDMI. 

CTGTCAGGAGAAGCCCTGGGATTGGGGCGAGGCAGACAGGGAGAGGGGCGGGCAGAGAA-A 

421 * + * ^SZ 

G AG AGGGGTCTTCGGG AGCGTAA.GGG GGCTGG^ *C *G*C'. — j.^ - ■ — ^ ^ x * * * 



I- J — r% xj J-k. — . - ' - r\ . - • ' - *^ — 

GG ATGTTTGC AA.CTGAATCAGCAGG AG AAAGAA.CTGGTGG AG ATGGTGGTGGGGGGG GAG 
^ ^ ^ ^ * 54 0 

ASLQLHQQQKELVQILLGAH 

FIG. 2 

^PaC^F 1 OF X\ 



4/n 

ACTCGCCATtjTXjGOCGCATTGT^TGACCAGTT'TGTCjC AGTTCAAGCCTC CAGCCTATCTG 
541 -t- ^ * -t- 60 0 

^TV" ■* r^T^ ^rrt ■» > •» ■» ■» /— 'rr-^y^ >»-r\/— • ■> > •* ^ •» ^ ^ -» > ^ i I'w ■ » ^ ^ ■» ^ ^ „ ^ /-^ ■> — . 



TRKVGPLFDQFVQFK 



P ? A y 



TTCATGGATCACCGGCCTTTCCAGCCTCGGGGCCCCGTGTTGCCTCTGCTCACACA-CTTT 

601 1 ■< ^ r 66 0 

AA-GTACGTAGTGGCCGGAAAGGTCGGAGCCCCGGCGCACAACGGAGACGAGTGTGTGAAJi. 



F M K H R P F Q P R G P V L P L L 



GCAGATATCAACACGTTTATGGTGCAA.CAGATCATCA-AGTTCACCAAGCATCTGCCGCTC 

651 + + + + -y- + 72 0 

CGTCTATAGTTGTGCAAATACCACGTTGTCTAGTA.GTTCAAGTGGTTCCTAGACGGCGAG 

ADirJTFMVQQIIKFTKDLPL 

TTCCGGTCCCTAACCATGGAGGACCAGATCTCCCTtctcaagggagcggctgtggaaata 

721 + + + + -r + 780 

AAGG C G AGGG ATTGGT ACCTC GTGGTCTAG AGGG Aa g a g C t c c c t c g c c g a c a c c t: 1 1 a t 

FRSLTMEDQISLLKGAAVEI 

ttgcatatctcactcaacactacgttctgtcct-caaacagagaat ttct tctgtgggcc t 

7S1 + + + + ^ ^ 840 

aacgtatagagtgagttgtgatgcaagacagaagtttgtctcttaaagaagacacccgga 



LH ISLNTTFCLQTENFFCG 



D 



ctt tgctacaagatggaggacgcagtccatgcagggttccagtacgagtittititggagt eg 

841 + + + + + + SOO 

gaaacgatgt t ctacctcctgcgtcaggtacgtcccaaggt: cat gct-caaaaacct cage 

LCYKMEDAVHAGFQYEFLES 

atcctccacttccataaaaacctgaaaggaccgcatctccaggagcctgagcatigtigctc 

901 + + + + + + 950 

taggaggtgaaggtatttt tggact t ccctgacgtagaggt cctcggact catacacgag 

ILKFHKNLKGLHLQEPEYVL 

ATGGGTGGCACGCCGCTCTTGTGCGGTGAGAGAGCCGGAGTTACCGAA-^.GAGAAGAGATA 
. ^ ^ ^ -".^nn 

TACCGAGGGTGGGGGGAGAA.GAGGGOACTGTGTGGGGGrC.--\TGGGTTTGTCTTCTCTAT 

MAATALFSPORPGVTQREEI 

GATCAGCTACAAGAGGAGATGGCGCTGATTCTGAAGAA-GGAGATTATGGAAGAAGAGTCT 

3_Q2i ^ ^ ^ + * ^ 1080 

CTAGTGGATGTTGTGGTCTACGGCGACTAAGACTTGTTGCTGTAJ^TACCTTGTTGTGAGA 



DQLQEEMALILrJNHlMEQQS 
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CGGCTCCAAAGTCSGTTTCTGTATGCAJ^J^GCTGATGCCCCTGCTGGCTGACCTCCGGAGT 



lOSl 



GCCGAGGTTTCAGCCAAAGACATACGTTTCGACTACCCGCACGACCGACTGOAGGCCTGA 
R L Q S R F L Y A K L M G L L A D L R S 
ATaaacaa t gca tac tccCa t gaac t ticagcgctcggaggaactdwCrcTccati'^a^^'-"'-' 

1141 ^ ^ ^ ^ — !-!!::! 1200 

TAcrrgnracgcatgaggatactCgaagtcgcgaacctccttgacagacgaractgcgcc 

IMNAYSYELQRLEELSAMTP 

ctgcrcggggagatttgcagttgaggcccaggcttgcatccttitccccagacccccagcg 
1201 ^ . . . 

cacgagcccctc::aaacgt:caactczgggtccgaacgcaggaaacgggt:ct:gggggtccc 
LLGEICS* 

atacactggcccggaaagggtacagcgctggaccccacacggcaaccagcaggaaggagc 
1261 ^ + + ^ ^ 1320 

tatgtgaccggacctttcccatgtcgcgacctggggcgcgcccttggtcgcccttcct ccj 



ttgggagrggcaatgaaatcctgaacagtcaaaaaaaaaaa 

1321 + + + ^- 1361 

aaccctcaccgttactttacgacttgtcagttttctttttt 
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OCX?. 



1 X T «TAx:.TLrr> 



RD.£L Q H ST 



^ M T 2 R H I C L?TLA V L 



181 QIIXJ-rJOL?! 



CV ^QL RTICA 



7 

RV V 1-L7 CT IOC Q L m d 



2il H;craY=rLZ5rL.MrTCrLxci_-cLcz? 
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d 
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;liO> The Genercil Hcspit: 

• 1 2 I^' > CAR F.ECEPTCF-S A^sil^ r 
METHODS 

- 1 2 C > C 0 7 8 6 / 1 2 5 W04 

14 C> Net Kr.cwn 



■ 1 5 Ci > 
■: 1 5 1 > 



3"/843 ,350 
1992-02-26 



:1EC)> 0 8/ 934,388 

:1E1> 1997-09-19 

:160> 19 

;1"'C> FastiSEQ for Windows 



: 10-. 



^ — i- - 

;213; 



145 0 
DNA 

Komo sap X ens 



< 4 0 0 > 
g tgagcct: gc 
ccatcccr ca 
tccaaccac: 
aaaggcat 
agtctgcgac 
acrgtgcgut 
gctgcaaggg 
ctggaagc -g 
agcgcttaga 
ggcgagcaaa 
aagaagagct 
aacagcttgt 
ccctggcc rc 
cgcaagccac 
agat ct ccct: 
tccgtctc=a 
cccgcgcggg 
gaaaaccgca 
ctgaccgacc 
cgactctgca 
cgaagt tgct 
t ccagcacat: 
gcca cgct ca 
acgcr gggac 
aaaacaaaac 



tc cttaagt t 
tgaaaaatga 
ggcaactcot: 
carraccagan 
agccacccca 
atgtggggac 
ttitcttcagg 
tgaagtcagc 
cgctggcatg 
gcaggcccag 
gatccggaca 
gcagtttagg 
tgtgctgcct 
caagnttact 
t r tcaaggga 
aacacaaaac 
gc tccaggta 
gctccaagag 
nggagttacc 
aagctacacc 
aggcctgcrg 
ccagggcctg 
ct tccttccc 
caaagacrgg 



acaggaactc 
ctgaccacr g 
gagaccagag 
tctgtggcct 
acacgcgacg 
caagccacag 
agaacagr ca 
aagactcaga 
aggaaagaca 
cggcgggcac 
ct cct: ggggg 
cct ccagc -c 
ctggtcacac 
aaggacctgc 
gcagctigtgg 
cncccctgcg 
gagttttrgg 
cccgagtacg 
cagagagacg 
aagggccagc 
gctigagctcc 
cct gccacga 
cagctcacct 
gccgggt tea 



r cct: tanaac 
c t gggcagca 
gaaaaccagc 
gcaggt gaca 
tcacggccag 
gctaccactt 
gcaaaagcat 
ggcgccaccg 
tgatactgt c 
agcaaacacc 
cccacacccg 
atctgt ccat 
act tcgcaga 

aaatctgt ca 
ggcct ct teg 
agt tgct ct t 
tgct ettggc 
agat tgatca 
agcgaaggcc 
gqaqcattaa 
tgccgctgc t 
ggaaeaecct 
aagggagecc 



agacacttca 
gqagggatga 
aacagcgtgg 
rgc tgcct aa 
tagggaagat 
taatgcgetg 
tggrcccacc 
cccagcctgc 
ggcagaagcc 
cgtgcaactg 
ccacatgggc 
ccat caccag 
catcaacac t 

catcgtact c 
ct acacaat t 
tcacttcca t 
tgccatggec 
gc tgcaagag 
ccgggat egg 
tgaggcctac 
ccaggagat c 
ggatacactg 
agtggttgca 



tt ttcctagt 
taat cc t aac 
gagt t tgggg 
gagaagcagg 
gagctgagga 
ac t tgtgagg 
tgcccctttg 
aggt tgcaga 
ctggcattgc 
agtaaggagc 
accatgt t tg 
ccct tgccca 
tncatggtac 
at tgaagacc 
aataccactt 
gaaga tggag 
ggaacactac 
etg 1 1 ctct c 
gagatggcac 
tttctgtatg 
gggtaccaaa 
tgcagctgag 
gagcgggaaa 
atgaaagact 



60 
120 
ISO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 
84 0 
900 
960 
102 0 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
14 50 



210;> 2 

211> 26 

212> DNA 

213-^ Home saDier.E 



c 4 G 0 > 1- 

-979^999^^ gy3.aggsnct ytt;/mg 



26 



< 2 1 0 > 3 

<211> 28 

<212> DNA 

<213> Ratitus Sf 



laaaa teaaaaacar aaccgc£ 



<:210> 4 
< 2 1 1 > 31 
^ 2 1 2 > DNA 

<213> Horr.o sapier.s 
<400> 4 



SUBSTITUTE SHEET (RULE 26) 



5NSCOC;D <wc gg^ssssAi ■_> 



w o 99/15555 



2 



PCTUS98/19365 



2 1 0 > 5 
<, ~ 1 1 Z ^ 

<Z12:> Rat-US sp 

gggragggcr caccgaaagc rcactrg 

<: 2 1 C > 6 

2 " 1 "> 2 S 

c 2 1 1 > DNA 

-.2 13 > Ratcus sp 

<4 00> 6 

aaaggraaaa ccagggacgt gaccrcag 

<210> 7 

c211> 30 

<212> DNA 

<:2 1 3 > Rartus sp 

< 4 0 Q > 

ccggaggtiga caggaggaca gcagccccga 

<210> 8 
<211> 44 

<213:> Ractus sp 

c400> e 

agacaggttg acccctrttiz caagggctca acccagccca ccug 

<210> 5 

<:2 11 > 2. S 

<: 2 1 2 DNA 

<213> Homo sapiens 

<400> 5 

aggacqt rgg ggt tagggga qgacagtg 

<2 10 > 10 
<211> 343 
<212> PRT 

c400> 10 

Met A' a Ser Arg Glu Asp Glu Leu Arg Asn Cys Val Val Cys Gly Asp 

1 ^ 5 10 15 

Gin A^a Thr Glv Tvr His Phe Asn Ala Leu Thr Cys Glu Gly Cys Lys 

20' ' 25 30 

Gly Phe Phe Arg Arg Thr Val Ser Lvs Ser lie Gly Pro Thr Cys Pro 

35 40 45 

Phe Ala Gly Ser Cvs Glu Val Ser Lys Thr Gin Arg Arg His Cys Pro 

5 0 ^ 5 5 6 0 

Ala Cvs Arq Leu Gin Lvs Cvs Leu Asp Ala Gly Met Arg Lys Asp Met 
65 ' 70 ' 75 80 

Leu Ser Ala Glu Ala Leu Ala Leu Arg Arg Ala Lys Gl:i Ala Gin 
85 90 55 

Arq A--q Ala Gin Gin Thr Pro Val Gin Leu Ser Lys Glu Gin Glu Glu 

100 105 110 

Leu lie Arg Thr Leu Leu Gly Ala His Thr Arg His Met Gly Thr Met 

-he G^ u Gin Phe Val Gin Phe Arq Pro Pro Ala His Leu Phe He His 

130 135 140 

His Gin Pro Leu Pro Thr Leu Ala Pro Val Leu Pro Leu Val Thr His 

145 150 155 ^ 160 

Pne A_a Asp He Asn Thr Phe Met Val Lea Gin Val He Lys Phe Thr 

165 1^0 175 

Lvs Aso Leu Pro Val Phe Arg Ser Leu Pro He Glu Asp Gin He Ser 

180 ^ 185 150 

Leu Leu Lvs Gly Ala Ala Val Glu He Cys His He Val Leu Asn Thr 

200 205 

Thr Phe C'/s Leu Gin Thr Gin Asn Phe Leu Cys Gly Pro Leu Arq Tyr 

">^c 22 c 

Thr I^e Glu Asp Giy A-^d Miy va^ oly rhe -^In Val 3^^ Ti.-- L^u Glut 



28 




<210> 11 
<211> 1361 
<212> DKA 
<213> Mus sp 

<400> 11 

czzc:zziicc: aagcactgag ca-ccaa^gz" 'zzzs.b.zz^cz ggcagtLcct gag at etc 

ggaaagcagg gtcagcgagg aggcctgggg agaggaggca tcctacaccc aatcttgtgg 120 

cctgctgcct aagggaaaca ggagaccatg acagctatgc taacactaga aaccatggcc 180 

agtgaagaag aacacgggcc gaggaactgt gtggcgtgtg gagaccgggc cacaggctat 240 

cacttccacQ ccctgacttg tgagggctgc aagggcttzt tcagacgaac agtcagcaaa 300 

accactggcc ccatctgtcc gttcgctgga aggtgtgagg tcagcaaggc ccagagacgc 360 

cactgcccag cctgcaggtt gcagaagtgt ctaaatgtcg gcacgaggaa agacatgata 420 

ctgtcagcag aagccctggc attgcggcga gccagacagg cacagcggcg ggcagagaaa 480 

gcatctttgc aactgaatca gcagcagaaa gaactggtcc agatcctcct gggggcccac 540 

actcaccata tgqgcccatt gtttgaccag tttgtigcagt tcaagcctcc agcctatctg 600 

cccatgcatc accggccttc ccagcctcgg ggccccgtgt cgcctctgct cacacacttt 660 

gcagatatca acacgtLtat ggtgcaacag actatcaagt tcaccaagga tccgctgctc 720 

ttccggtccc taaccatgga ggaccagatc tcccttccca agggagcggc tgrggaaata 780 

ttgcatatct cactcaacac cacgtcctgt cttcaaacag agaatttctt ctgtgggccr 840 

ctttgctaca agatggagga cgcagtccat gcagggttcc agtacgagtt tttggagtcg 900 

accctccact tccataaaaa cctgaaagga ctgcatctcc aggagcctga gtatgtgctc 960 

ac^rgctgcca cggccctctt ctcccctgac agacccggag ttacccaaag agaagagata 1020 

gatcagctac aagaggagat ggcgctgatt ctgaacaacc acattatgga acaacagtct 1080 

cagctccaaa gtcggtttct gtatgcaaag ctgatgggcc tgccggctga cctccggagt 1140 

ataaacaacg catactccta tgaacttcag cgrttggagg aactgtctgc tacgacgccg 1200 

ctgctcgggg agatttgcag ttgaggccca ggzttgcatc ctttccccag acccccaggg 1260 

atacactggc ctggaaaggg tacagcgccg ga=cccacac gggaaccagc aggaaggagc 1320 

ttggaagtgg caatgaaatg ctgaacagtc aaaaaaaaaa a 1361 

<210> 12 
<211> 35B 
<212> PRT 
<213> Mus sp 

<400> 12 



Met 


Thr 


Ala 


Met 


Leu 


Thr 


Leu 


Glu 


Thr 


Met 


Ala 


Ser 


Glu 


Glu 


Glu 


Tyr 


1 








5 










1 0 














Gly 


Pre 


Arg 


A.sn 


Cys 


Val 


Val 


Cys 


Gly 


Asp 


Arg 


Ala 


Thr 


Gj.y 


- y ^ 


His 






2 0 










25 










30 






Phe 


His 


Ala 


Leu 


Thr 


Cys 


Glu 


'-J J. y 


Cys 


i^'/3 




Phe 


Phe 


Arg 


Arg 


Thr 






3 B 










4 0 










4 5 








Vai 


Ser 


Lys 


Thr 


He 


Gly 


Pro 


lie 


Cys 


Pro 


Phe 


A^a 


Gly 


Arg 


Cys 


Glu 




Ser 


Lys 


Ala 


Gin 


Arg 


Arg 


His 


Cys 




A- J. a 


Cys 


Arg 


Leu 


G m 


hys 


6 5 










70 






















Cys 


Lieu 


Asn 


Val 


Gly 


Met 


Arg 


Lys 


Asp 


Met 


lie 


Leu 


Ser 


Ala 


Giu 


Ala 










85 










90 










q c 




Leu 


Al a 


Leu 


Arg 


Arg 


Ala 


A.rg 


Gin 


Ala 


Gin 


Arg 


Arg 


Ala 


±. u 


Lys 


Ala 








100 










1 0 E. 










110 






Ser 


Leu 


Gin 


Leu 


Asn 


Gin 


Gin 


Gin 


Lys 


Glu 


Leu 


Val 


Gin 


lie 


Leu 


Leu 


























125 












115 




























Gly 


Al a 


His 


Thr 


Arg 


His 


Val 


Gi y 


Fro 


Leu 


Phe 


Asp 


Gin 


Phe 


Val 


Gin 


Phe 


L%'" s 


Pro 


Pro 


Ala 




13 5 


Phe 


Me t 


His 


His 


Arg 




Phe 


n'f 


Pro 


14 5 










150 




















160 


Arg 


G^y 


Pro 


Val 


Leu 


Pro 




Leu 


Thr 


His 


Phe 


Ala 


Asp 


He 


Asn 


Thr 








165 










170 










175 




Phe 


Met 


Val 


Gin 


Gin 






j^y 3 


Phe 


Thr 


Lys 


Asp 


Leu 


Pro 


Leu 


Fhf^ 








150 




















190 






Arg 


Ser 


Leu 


Thr 


Met 


Gl u 


Asp 






Ser 


Leu 




Lys 




Ala 


Ala 
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19 = 



. al 


Glu 
21C 


lie 




His 


1 ^ e 


Ser 
215 


Leu 


A s n 




T r. r 


rne 
2 2 : 


lys 


Leu 




Thr 


31u 


A^n 


Fhe 


Phe 


Cys 


Glv 


Frc 


:_eu 


lys 


T\' r 


Lys 


Met 


Glu 


A s p 


Ala 


Val 


225 










2 3 0 










2 3 r: 










24 0 


Hl£ 


Ali 


■/ 


?he 


G 1 n 
24 5 


Tyr 


Glu 


Fhe 


Leu 


■2- 1 u 
2 5-. 


Ser 


lie 


Leu 


H 1 s 


? he 
255 


His 


2ys 


Asn 


Leu 


Lys 


G 1 y 


Leu 


His 


Leu 


G_n 


G« u 


Pi o 


G ^ u 




.'a ^ 


Le u 


Me ti 








26 0 










2c 5 










2 C 






Ala 


Ala 


Thr 

2 ':'5 


Ala 


Leu 


Fhe 


Ser 


Pro 


Asp 


A*r 9 


? r o 


T - ' 


V a ^ 

^ ^ 




3'' ^ 


Ar'3 


Glu 


Glu 


lie 


Asp 


Gin 


Leu 


Gin 


Giu 


Glu 


Met 


Ala 


Leu 


lie 


Leu 


Asn 


Asn 






























Ala 


His 


lie 


y.ez 


Glu 


Gin 


Gin 


Ser 


Ar^ 


Leu 


Gin 


Ser 


Arg 


Fhe 


Leu 


Tyr 


3 05 










3 10 










315 








Ala 


3 20 


:.ys 


Leu 


Met 


Gly 
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<210> 13 
<211> 18 
c212> DNA 
<213> Mu3 sp 

< 4 0 0 > 1 3 
ttcctaccca catacggc 

<210> 14 

^211> 2C 

c^2 12> DNA 

<2 13> Mus sp 

<400> 14 

gaccctgctt tcctitgagat 20 

<210> 15 
<211> 20 
<212> DNA 
<213> Mus Sp 

^400^ 1^3 

^^^^^^^^^^ 20 



^210^ IG 
c211> 23 
c:212> DNA 
<213> Mus sp 

<400> 16 

actgcaaatc tccccgagca gcg 23 

<210> 17 
<211> 25 
<212> DNA 
<213> Mus sp 

<400> 17 

taggaggtrg actagagcrc cttct 25 

<210> 18 

<211> 2-7 

<212> DNA 

<213> Mus sp 

< ! 0 0 > 13 

gnr. r.g^u.^iLa tLaCtaCtCCtCtCttC 27 

<:2 10> 19 

<211> 3 6 

c212> DNA 

';2 13> Mus sp 

■r 4 0 0 > 1 ? 

ugccrcagtg cct:ggaaaac uagggzctt- ti^ttgc 3-' 
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